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Sir: 

This is in response to the Office Action (Paper No. 12) mailed herein on January 17, 
2003, a response to which is due on April 17, 2003. Enclosed herewith is a Petition for an 
Extension of Time under 37 C.F.R. § 1. 136(a) for a response for a period of one (1) month from 
April 17, 2003, up to and including May 17, 2003, and the fee therefor, thereby making this 
a timely response. 

The Commissioner is authorized to charge any fees which may be required, including 
extension fees, or credit any overpayment to Deposit Account No. 07-0630. 
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In the Claims: 

Please cancel Claims 1-52 without prejudice or disclaimer. 
Please add new claims 53-69. 



-53. (New) An isolated nucleic acid molecule having at least 80% sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2, 
lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO: 1; 

(d) the full-length coding sequence of the nucleic acid sequence shown as SEQ ID 
NO:l;or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA- 1532; wherein said encoded polypeptide is a receptor for and binds 
to the ligand polypeptide shown as SEQ ID NO:4. 

54. (New) An isolated nucleic acid molecule having at least 85% sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2, 
lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO: 1; 

(d) the full-length coding sequence of the nucleic acid sequence shown as SEQ ID 
NO:l; or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA- 1532; wherein said encoded polypeptide is a receptor for and binds 
to the ligand polypeptide shown as SEQ ID NO:4. 

55. (New) An isolated nucleic acid molecule having at least 90% sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2, 
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lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO: 1; 

(d) the full-length coding sequence of the nucleic acid sequence shown as SEQ ID 
NO:l; or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA- 1532; wherein said encoded polypeptide is a receptor for and binds 
to the ligand polypeptide shown as SEQ ID NO:4. 

56. (New) An isolated nucleic acid molecule having at least 95% sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2, 
lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO: 1; 

(d) the full-length coding sequence of the nucleic acid sequence shown as SEQ ID 
NO:l; or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA- 1532; wherein said encoded polypeptide is a receptor for and binds 
to the ligand polypeptide shown as SEQ ID NO:4. 

57. (New) An isolated nucleic acid molecule having at least 99% sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2, 
lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO: 1; 

(d) the full-length coding sequence of the nucleic acid sequence shown as SEQ ID 
NO:l;or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA-1532;wherein said encoded polypeptide is a receptor for and binds 
to the ligand polypeptide shown as SEQ ID NO:4. 
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58. (New) An isolated nucleic acid molecule comprising: 

(a) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2, 
lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO: 1; 

(d) the lull-length coding sequence of the nucleic acid sequence shown as SEQ ID 
NO:l; or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA-1532. 

59. (New) The isolated nucleic acid molecule of Claim 58 comprising a nucleic acid 
sequence encoding the polypeptide shown as SEQ ID NO:2. 



J 60. (New) The isolated nucleic acid molecule of Claim 58 comprising a nucleic acid 

sequence encoding the polypeptide shown as SEQ ID NO:2, lacking its associated signal 
peptide. 

61. (New) The isolated nucleic acid molecule of Claim 58 comprising the nucleic acid 
sequence shown as SEQ ID NO:l. 

62. (New) The isolated nucleic acid molecule of Claim 58 comprising the full-length coding 
sequence of the nucleic acid sequence shown as SEQ ID NO:l. 

63. (New) The isolated nucleic acid molecule of Claim 58 comprising the full-length coding 
sequence of cDNA deposited under ATCC accession number PTA- 1532. 

^ (New) An isolated nucleic acid molecule that hybridizes to: 

(a) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as SEQ ID NO:2, 
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lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO: 1; 

(d) the full-length coding sequence of the nucleic acid sequence shown as SEQ ID 
NO:l; or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA-1532. 

65. (New) The isolated nucleic acid molecule of Claim 64, wherein said hybridization 
occurs under stringent conditions of 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M 
sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x 
Denhardt's solution, sonicated salmon sperm DNA (50 /ig/ml), 0.1% SDS, and 10% 
dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium chloride/sodium 
citrate) and 50% formamide at 55 °C, followed by a high-stringency wash consisting of 0. 1 
x SSC containing EDTA at 55°C. 



f 



66. (New) A vector comprising the nucleic acid molecule of ClainW. 

67. (New) The vector of Claim 66, wherein said nucleic acid is operably linked to control 
sequences recognized by a host cell transformed with the vector. 

68. (New) ' A host cell comprising the vector of Claim 66. 

69. (New) The host cell of Claim 68, wherein said cell is a CHO cell, an E. coli or a yeast 
cell.-- 
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REMARKS 

Claims 1-52 remain pending in the application for consideration. Claims 1-17 are 
under consideration subject to Applicants' restriction requirement election. Applicants have 
herein cancelled Claims 1-52 and added new Claims 53-69 so as to more clearly define the 
subject matter claimed therein. In that the newly added claims do not introduce new subject 
matter and are supported in the specification as originally filed, their entry is respectfully 
requested. Specification support for the newly added Claims 53-69 can be found at least as 
follows: page 2, lines 14-22, 25-27; page 3, lines 1-37; page 4, lines 1-12, 24-36; page 6, lines 
31-37; page 7, lines 1-35; page 8, lines 8-20; page 9, lines 26-30; page 18, lines 1-4; page 52, 
lines 28-37; and EXAMPLES 1, 11, 12 and 13; and the claims as originally filed. 

Information Disclosure Statement 

Applicant has been invited to submit a new PTO-1449 form listing the foreign patent 
documents, with dates listed, for consideration. A new PTO-1449 form is herein submitted 
listing the missing dates which were previously inadvertently omitted. 

Objections and Rejections under 35 U.S.C. $112 Second Paragraph 

Claims 4, 9, 10, 11, 12, 16 and 17 stand rejected under 35 U.S.C. §112, second 
paragraph, as allegedly being indefinite for failing to particularly point out and distinctly 
claim the subject matter which Applicants regard as the invention. Without necessarily 
agreeing with the propriety of the rejection, Claims 1-52 have been cancelled and new Claims 
53-69 have been added so as to more clearly define the subject matter. The newly added 
claims correct typographical errors in originally filed Claims 4, 9, 10 and 16. In light of newly 
added Claims 53-69, Applicants respectfully submit that the indefinite rejection of original 
Claim 17 has been overcome. In addition, the Examiner has objected to the recitation of 
"stringent conditions" in Claims 11 and 12, without reciting specific conditions in the original 
claims. Accordingly, newly added Claims 64-65 have been amended to indicate the specific 
stringent hybridization conditions. In light of the amendment to the claims and the discussion 
above, Applicants respectfully request reconsideration and withdrawal of the outstanding 
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rejections under 35 U.S.C. §112 second paragraph. 

Objections and Rejections under 35 U.S.C. $101 

Claims M7 stand rejected under 35 U.S.C. §101 for allegedly lacking a credible, 
specific and substantial asserted utility or a well established utility. In light of the claim 
amendments made herein and for the reasons set forth below, Applicants respectfully traverse. 

Utility - Legal Standard 

According to the Utility Examination Guidelines ("Utility Guidelines"), 66 Fed. Reg. 
1092 (2001) an invention complies with the utility requirement of 35 U.S.C. § 101, if it has at 
least one asserted "specific, substantial, and credible utility" or a "well-established utility." 
Under the Utility Guidelines, a utility is "specific" when it is particular to the subject matter 
claimed. For example, it is generally not enough to state that a polypeptide is useful as a 
diagnostic without also identifying the condition that is to be diagnosed. The requirement of 
"substantial utility" defines a "real world" use, and derives from the Supreme Court's holding 
in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "The basic quid pro quo 
contemplated by the Constitution and the Congress for granting a patent monopoly is the 
benefit derived by the public from an invention with substantial utility." In explaining the 
"substantial utility" standard, M.P.E.P. 2107.01 cautions, however, that Office personnel must 
be careful not to interpret the phrase "immediate benefit to the public" or similar formulations 
used in certain court decisions to mean that products or services based on the claimed 
invention must be "currently available" to the public in order to satisfy the utility requirement. 
"Rather, any reasonable use that an applicant has identified for the invention that can be 
viewed as providing a public benefit should be accepted as sufficient, at least with regard to 
defining a "substantial" utility." (M.P.E.P. 2107.01, emphasis added.) Indeed, the Guidelines 
for Examination of Applications for Compliance With the Utility Requirement, set forth in 
M.P.E.P, 2107 II (B) (1) gives the following instruction to patent examiners: "If the applicant 
has asserted that the claimed invention is useful for any particular practical purpose . . . and 
the assertion would be considered credible by a person of ordinary skill in the art, do not 
impose a rejection based on lack of utility." 
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Finally, the Utility Guidelines restate the Patent Office's long established position that 
any asserted utility has to be "credible." "Credibility is assessed from the perspective of one 
of ordinary skill in the art in view of the disclosure and any other evidence of record . . . that 
is probative of the applicant's assertions." (M.P.E.P. 2107 II (B) (1) (ii)) Such standard is 
presumptively satisfied unless the logic underlying the assertion is seriously flawed, or if the 
facts upon which the assertion is based are inconsistent with the logic underlying the assertion 
(Revised Interim Utility Guidelines Training Materials, 1999). 

Proper Application of the Legal Standard 

Applicants submit that the microarray data provided in the present application is 
sufficient to establish a credible, specific and substantial utility for PRO3301polypeptides 
identified as SEQ ID NO:4. Furthermore, it is Applicants belief that the "universal normal 
control" described in EXAMPLE 13, is a reliable negative control and is predictive of normal 
expression levels found in healthy tissues of the same tissue type as the cancerous tumors 
examined. As such, microarray studies utilizing a "universal normal control" serve as a 
predictive indicator of overexpression or upregulation of gene expression in cancerous tumors. 
Applicants would like to draw the Examiner's attention to the fact that parallel studies were 
concurrently conducted, wherein the microarray analysis of the test sample was compared to 
expression levels exhibited in a control sample using normal healthy cells representative of 
the same tissue type as the cancerous samples examined. In this regard, these parallel studies 
compared the expression levels of PRO3301 polypeptide-encoding nucleic acid sequences in: 
(1) non-small cell lung adenocarcinoma versus normal lung tissue controls, (2) lung squamous 
cell carcinoma versus normal lung tissue controls, and (3) colorectal adenocarcinoma versus 
normal colorectal tissue controls. These contemporaneous studies demonstrated that compared 
to their normal healthy tissue counterparts, PRO3301 polypeptide-encoding nucleic acid 
sequences showed, on average, a 2.4 fold, 9.2 fold, and 5.3 fold increase in expression levels 
in non-small cell lung adenocarcinomas, lung squamous cell carcinomas, and colorectal 
adenocarcinomas, respectively. In light of the arguments and discussion above, PRO3301 
polypeptides of the present invention are significantly overexpressed in specific tumor tissue(s) 
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when compared to normal controls. Thus, PRO3301polypeptide-encoding nucleic acid 
sequences of the present invention can be useful as diagnostic markers for non-small cell lung 
adenocarcinomas, lung squamous cell carcinomas and colorectal adenocarcinomas. In view 
of these findings, the novel PRO 19598 receptor which binds to a functional PRO3301 
(diagnostic marker) would likewise possess a credible, specific and substantial utility. The 
binding event in and of itself can serve as a useful in vitro diagnostic tool to detect and 
measure the levels of PRO3301 polypeptides in an unknown test sample wherein detected 
overexpression levels would be suggestive of the presence of lung and colorectal tumors. It is, 
of course, true that further research would be needed to develop a diagnostic product. 
However, the fact that such follow-up tests might be necessary, cannot properly lead to the 
legal conclusion that either PRO3301 or PRO 19598 polypeptides or their corresponding 
encoding nucleic acids lack patentable utility. 

As set forth in M.P.E.P., 2107 II (B) (1), if the Applicant has asserted that the claimed 
invention is useful for any particular purpose, and the assertion would be considered credible 
by a person of ordinary skill in the art, a rejection based on lack of utility should not be 
imposed. Accordingly, the Examiner is respectfully requested to reconsider and withdraw the 
present rejection under 35U.S.C. §101. 

Objections and Rejections under 35 U.S.C. §112. First Paragraph 

(1) Claims 22-41 stand rejected under 35 U.S.C. §112, first paragraph as allegedly not 
being supported by either a credible, specific and substantial asserted utility or a well 
established utility so that one skilled in the art would not know how to use the claimed 
invention. Without necessarily agreeing with the propriety of the rejection, Applicants have 
herein canceled Claims 1-52 rendering the rejection of Claims 22-41 under 35 U.S.C. § 1 12, first 
paragraph in the instant Office Action moot. Accordingly, Applicants respectfully request that 
the outstanding rejection of Claims 22-41 under under 35 U.S.C. §112, first paragraph be 
withdrawn. 

(2) Claims 1, 5, 7 and 9-17 stand rejected under 35 U.S.C. §112, first paragraph, as 
allegedly, containing subject matter which was not described in the specification in such a way 
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as to reasonably convey to one skilled in the relevant art that the inventor(s), at the time the 
application was filed, had possession of the claimed invention. The Examiner further states 
"...only isolated polypeptides comprising the amino acid sequence set forth in SEQ ID NO:2, but 
not the full breadth of the claim meets the written description provision of 35 U.S.C. §112, first 
paragraph". Accordingly, the Examiner alleges that the specification does not provide 
adequate written description of the claimed genus. Applicants respectfully traverse. 

In light of the amendment to the claims and the discussion above traversing the 
rejections under 35 U.S.C. § 101, Applicants respectfully submit that the specification discloses 
a credible, specific and substantial utility for both PRO3301 and PR019598 polypeptides as 
useful diagnostic tools for detection of lung and colorectal tumors. Applicants have herein 
amended the rejected genus claims to recite variant nucleic acid sequences encoding a 
PRO 19598 polypeptide, wherein said encoded polypeptide is a receptor for and binds to the 
ligand polypeptide shown as SEQ ID NO:4. Accordingly, the genus claims, as amended, recite 
a specific utility for the claimed variant nucleic acids (i.e. said polypeptide serving as a specific 
receptor that binds to a functional ligand polypeptide). Since the claims are now drawn to a 
genus defined by both sequence and functional identity, one of ordinary skill in the art would 
know at the effective priority date of this application, that the Applicants were in possession 
of the claimed sequences. Accordingly, the Examiner is respectfully requested to reconsider 
and withdraw the present rejection of the claims under 35 U.S.C. §112, first paragraph. 

In light of the above amendments and remarks, Applicants believe that this application 
is now in condition for immediate allowance and respectfully request that the outstanding 
objections be withdrawn and this case passed to issue. 

The Examiner is invited to contact the undersigned at (650) 225-4563 if any issues may 
be resolved in that manner. 

Attached hereto is a marked-up version of the changes made to the claims by the 
current amendment. The attached page is captioned ' Version with markings to 
show changes made." 
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PATENT TRADEMARK OFFICE 



Respectfully submitted, 
GENENTECH, INC. 



By: 



Elizabeth M. Barnes, Ph.D. 
Reg. m35, 059 
Telephone: (650) 225-4563 
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Version with markings to show changes made 

In the Claims 

Claims 1-52 have been cancelled 
New Claims 53-69 have been added 
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22313-1450 on 

September 2, 2003 
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AMENDMENT UNDER 37 C.F.R. §1.111 

Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 2231 3-1 450 

Sir: 

This document is responsive to the Final Office Action mailed July 1 . 2003 (Paper No. 
1 5) for which a three-month period for response was given. This response is timely filed and 
the fee therefor. In view of the discussion provided, reconsideration and allowance are 
respectfully requested. 

The Commissioner is authorized to charge any fees which may be required, 
including extension fees, or credit any overpayment to Deposit Account No. 07-0630. 



Amendments to the Claims are reflected in the listing of claims which begins on page two 
(2) of this paper. 

Remarks begin on page seven (7) of this paper. 
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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims : 

Claims 1-52 have been cancelled. 

53. (Currently amended) An isolated nucleic acid molecule having at least 80% 

sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2, lacking 

its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:1; 

(d) the full-length coding sequence of the nucleic acid sequence 
shown as SEQ ID NO:l;or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA-1532; wherein 3aid encoded po l yp e ptid e said isolated nucleic 
acid encodes a polypeptide which is a receptor for and binds to the ligand 
polypeptide shown as SEQ ID NO:4. 

54. (Currently amended) An isolated nucleic acid molecule having at least 85% 
sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2, lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:1 ; 

(d) the full-length coding sequence of the nucleic acid sequence 
shown as SEQ ID NO: 1; or 
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(e) the full-length coding sequence ofcDNA deposited under ATCC accession 
number PTA-1532; wherein 3aid encoded po l ypeptide said isolated nucleic 
acid encodes a polypeptide which is a receptor for and binds to the ligand 
polypeptide shown as SEQ ID NO:4. 

55. . (Currently amended) An isolated nucleic acid molecule having at least 90% 
sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as 
SEQIDNO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2, lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 

(d) the full-length coding sequence of the nucleic acid sequence 
shown as SEQ ID NO:l;or 

(e) the full-length coding sequence of cDN A deposited under ATCC accession 
number PTA-1 532; wherein said encoded polypeptid e said isolated nucleic 
acid encod es a polypeptide which is a receptor for and binds to the ligand 
polypeptide shown as SEQ ID NO:4. 

56. (Currently amended) An isolated nucleic acid molecule having at least 95% 
sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2, lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l ; 

(d) the full-length coding sequence of the nucleic acid sequence 
shown as SEQ ID NO:l;or 
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(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA-1532; wherein sa i d encod e d po l ypept i d e said isolated nucleic 
acid encod es a polypeptide which is a receptor for and binds to the ligand 
polypeptide shown as SEQ ID NO:4. 

57. (Currently amended) An isolated nucleic acid molecule having at least 99% 

sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2, lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:1; 

(d) the full-length coding sequence of the nucleic acid sequence 
shown as SEQ ID NO: 1; or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA-1532; wherein said encoded polypeptid e said isolated nucleic 
acid encodes a polypeptide which is a receptor for and binds to the ligand 
polypeptide shown as SEQ ID NO:4. 

58. (Previously presented) An isolated nucleic acid molecule comprising: 

(a) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2, lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l ; 

(d) the full-length coding sequence of the nucleic acid sequence 
shown as SEQ ID NO:l;or 

(e) the full-length coding sequence of cDNA deposited under ATCC accession 
number PTA-1532. 
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59. (Previously presented) The isolated nucleic acid molecule of Claim 58 
comprising a nucleic acid sequence encoding the polypeptide shown as SEQ ID 
NO:2. 

60. (Previously presented) The isolated nucleic acid molecule of Claim 58 
comprising a nucleic acid sequence encoding fhe polypeptide shown as SEQ ID 
NO:2, lacking its associated signal peptide. 

61. (Previously presented) The isolated nucleic acid molecule of Claim 58 
comprising the nucleic acid sequence shown as SEQ ID NO:l . 

62. (Previously presented) The isolated nucleic acid molecule of Claim 58 
comprising the full-length coding sequence of the nucleic acid sequence shown as 
SEQ ID NO:1. 

63. (Previously presented) The isolated nucleic acid molecule of Claim 58 
comprising the full-length coding sequence of cDNA deposited under ATCC 
accession number PTA-1532. 

+64. (Currently amended) An isolated nucleic acid molecule that hybridizes to: 

(a) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide shown as 
SEQ ID NO:2. lacking its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:1; 

(d) the full-length coding sequence of the nucleic acid sequence 
shown as SEQ ID NO:l;or 

(e) the full-length coding sequence of cDN A deposited under ATCC accession 
number PTA- 1 532 : wherein said isolated nucleic acid encodes a polypeptide 
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which is a receptor for and binds to the ligand polypeptide shown as SEQ ID 
NO:4.. 

65. (Previously presented) The isolated nucleic acid molecule of Claim 64, wherein 
said hybridization occurs under stringent conditions of 50% formamide, 5 x SSC (0.75 
M NaCI, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium 
pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 pg/ml), 0. 1 % 
SDS, and 10% dextran sulfate at 42 C, with washes at 42 C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% formamide at 55 C, followed by a high-stringency 
wash consisting of 0.1 x SSC containing EDTA at 55 C. 

66. (Currently amended) A vector comprising the isolated nucleic acid molecule 
of Claim +53. 

67. (Currently amended) The vector of Claim 66, wherein said isolated nucleic acid 
is operably linked to control sequences recognized by a host cell transformed with the 
vector. 

68. (Previously presented) A host cell comprising the vector of Claim 66. 

69. (Previously presented) The host cell of Claim 68, wherein said cell is a CHO cell, 
an E. coli or a yeast cell. 

REMARKS 

Claims 53-69 remain in this application. Claims 1-52 have been previously 
canceled. Applicants have herein amended Claims 53-57, 64, and 66-67 so as to 
more clearly define the subject matter claimed therein. In that the newly amended 
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claims do not introduce new subject matter and are supported in the specification 
and the claims as originally filed, their entry is respectfully requested. 

Oblections and Rejections Under 35 U.S.C. $ 112. Second Paragraph 

Claims 53-57 and 66-69 stand rejected under 35 U.S.C. § 1 1 2, second paragraph, 
as allegedly being indefinite for failing to particularly point out and distinctly claim the 
subject matter which Applicants regard as the invention. Claims 53-57 have been 
amended to correct an improper antecedent basis for the phrase "said encoded 
polypeptide" by deleting the phrase and replacing it with the phrase "said isolated 
nucleic acid encodes a polypeptide which" as suggested by the Examiner. Claim 66 
has been amended to depend from Claim 53 rather than from cancelled Claim 1 . In 
light of the amendments to the claims and the discussion above, Applicants respectively 
request reconsideration and withdrawal of the outstanding rejections under 35 U.S.C. § 
1 12, second paragraph. 

Rejections Under 35 U.S.C. § 101 

Claims 53-69 stand rejected under35 U.S.C. § 1 01 for allegedly lacking a credible, 
specific and substantial asserted utility or a well established utility. The Examiner has 
stated that it is unclear from Example 1 3 of the specification that the "universal normal 
control" is an appropriate control. However, the Examiner has indicated that the new 
data from parallel and concurrent studies that was submitted by Applicant in their 
response to the January 1 7, 2003 Office Action may be probative of the utility of the 
claimed invention. Yet, such new data is more properly presented in the form of a 
declaration under 37 C.F.R. §1.132. Applicants herein provide the Declaration of 
Thomas D. Wu, M.D., Ph.D., wherein Dr. Thomas D. Wu declares that in addition to using 
the above described "universal normal control" in these microarray studies, parallel 
studies were concurrently conducted. In these contemporaneous studies, the level of 
expression of the PRO3301 -encoding nucleic acid sequence was compared in both 
tumor samples and in control samples of the same tissue type as the tumor samples 
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examined. These concurrent studies demonstrated that compared to their normal 
healthy tissue counterparts, PRO3301 -encoding nucleic acid sequences showed, on 
average, a 2.4 fold, 9.2 fold, and 5.3 fold increase in expression levels in non-small cell 
lung adenocarcinomas, lung squamous cell carcinomas, and colorectal 
adenocarcinomas, respectively. These results demonstrate that PRO3301 -encoding 
nucleic acid sequences are significantly overexpressed in specific tumor tissue(s) when 
compared to their specific normal tissue controls and thus can be useful as a diagnostic 
marker for the presence or absence of such specific tumor(s) in a tissue sample of 
unknown pathology. As such, the PR0 19598 polypeptides encoded by the claimed 
polynucleotides have patentable utility in their being able to bind/label PRO3301 in 
cancer diagnostic assays. 

Applicants respectfully request the acceptance into evidence Dr. Thomas D. 
Wu's Declaration submitted herein under 37 C.F.R. § 1 . 1 32. In light of this declaration, 
amendment to the claims and the discussion above, Applicants respectfully request 
reconsideration and withdrawal of the outstanding rejections under 35 U.S.C. §101. 

Rejections Under 35 U.S.C. $ 112. First Paragraph 

Claims 53-69 are rejected under 35 U.S.C. § 1 1 2, first paragraph for allegedly 
lacking 

a credible, specific and substantial asserted utility or a well established utility for the 
same reasons outlined in the rejections under 35 U.S.C. §101. Applicants have herein 
provided the Declaration of Dr. Thomas D. Wu under 37 C.F.R. §1.132 to provide 
evidence and support for Applicants' assertion that the claimed invention is 
supported by a credible, specific and substantial asserted utility or a well established 
utility so that one skilled in the art would know how to use the claimed invention. In 
the light of this declaration and the arguments and discussion traversing the rejections 
under 35 U.S.C. § 101. Applicants respectfully submit that specification discloses a 
credible, specific and substantial utility for the claimed invention as useful diagnostic 
tools for detection of lung and colorectal tumors. 
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In addition. Claims 64-6? are rejected under 35 U.S.C. § 1 1 2. first paragraph for 
allegedly reciting a genus of polynucleotides without indicating functional limitations. 
The Examiner has stated that Claims 53-63. as previously amended recite structural 
and functional limitations for the encoded polypeptides resulting in an adequate 
written description of a claimed genus. Applicants have herein amended Claims 64 
and 66-67 to provide the functional limitation " wherein said isolated nucleic acid 
encodes a polypeptide which is a receptor for and binds to the liaand polypeptide 
shown as SEQ ID NQ:4" in order to provide an adequate written description of the 
claimed genus. Thus, the genus claims, as amended, recite a specific utility for the 
claimed variant nucleic acids which encode a polypeptide which is receptor for and 
binds to a functional ligand polypeptide. Since the claims are now drawn to a genus 
defined by both sequence and functional identity, one of ordinary skill in the art 
would know at the effective priority date of this application, that the Applicants were 
in possession of the claimed sequences. 

In view of the Declaration of Dr. Thomas D. Wu submitted herewith, 
amendments to the claims and the discussion above traversing the rejections under 
35 U.S.C. §§ 101 and 112, Applicants respectfully submit that the specification 
discloses a credible, specific and substantial utility for the claimed invention as useful 
diagnostic tools for detection of lung and colorectal tumors. Accordingly, the 
Examiner is respectfully requested to reconsider and withdraw the rejections of the 
claims under 35 U.S.C. § 1 1 2, first paragraph. 

In light of the above amendments and remarks, Applicants believe that this 
application is now in condition for immediate allowance and respectfully request that 
the outstanding rejections be withdrawn and this case passed to issue. 
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The Examiner is invited to contact the undersigned at (650) 225-4563 if any 
issues may be resolved in that manner. 



Date: September 2, 2003 



lDNAWay,MS49 

South San Francisco, CA 94080 

Phone:(650)2254563 

Fax:(650)952-9881 



Respectfully submitted, 
GENENTECH. INC. 




By:_ 

Elizabeth M. $5rnes, Ph.D 
Reg. No. 35,059 



NTECH. INC. / 
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FIGURE 1 

CAGTTTCTTCATCTGTAACATCAAATGAATAATAATACCAATCTCCTAGACTTCATAAGA 
GGATTAACAAAGACAAAATATGGGAAAAACATAACATGGCGTCCCATAATTATTAGATCT 
TATTATTGACACTAAAATGGCATTAAAATTACCAAAAGGAAGACAGCATCTGTTTCCTCT 
TTGGTCCTGAGCTGGTTAAAAGGAACACTGGTTGCCTGAACAGTCACACTTGCAACCATG 
ATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTCTTCCTTACTGGTGTAGCAGGA 
ACTCAGTCAACGCATGAGTCTCTGAAGCCTCAGAGGGTACAATTTCAGTCCCGAAATTTT 
CACAACATTTTGCAATGGCAGCCTGGGAGGGCACTTACTGGCAACAGCAGTGTCTATTTT 
GTGCAGTACAAAATCATGTTCTCATGCAGCATGAAAAGCTCTCACCAGAAGCCAAGTGGA 
TGCTGGCAGCACATTTCTTGTAACTTCCCAGGCTGCAGAACATTGGCTAAATATGGACAG 
AGACAATGGAAAAATAAAGAAGACTGTTGGGGTACTCAAGAACTCTCTTGTGACCTTACC 
AGTGAAACCTCAGACATACAGGAACCTTATTACGGGAGGGTGAGGGCGGCCTCGGCTGGG 
AGCTACTCAGAATGGAGCATGACGCCGCGGTTCACTCCCTGGTGGGAAACAAAAATAGAT 
CCTCCAGTCATGAATATAACCCAAGTCAATGGCTCTTTGTTGGTAATTCTCCATGCTCCA 
AATTTACCATATAGATACCAAAAGGAAAAAAATGTATCTATAGAAGATTACTATGAACTA 
CTATACCGAGTTTTTATAATTAACAATTCACTAGAAAAGGAGCAAAAGGTTTATGAAGGG 
GCTCACAGAGCGGTTGAAATTGAAGCTCTAACACCACACTCCAGCTACTGTGTAGTGGCT 
GAAATATATCAGCCCATGTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAA 
ATTCC ATGA CTTGTGGAATTTGGCATTCAGCAATGTGGAAATTCTAAAGCTCCCTGAGAA 
CAGGATGACTCGTGTTTGAAGGATCTTATTTAAAATTGTTTTTGTATTTTCTTAAAGCAA 
TATTCACTGTTACACCTTGGGGACTTCTTTGTTTATCCATTCTTTTATCCTTTATATTTC 
ATTTGTAAACTATATTTGAACGACATTCCCCCCGAAAAATTGAAATGTAAAGATGAGGCA 
GAGAATAAAGTGTTCTATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA14 5887 
Xsubunit 1 of 1, 262 aa, 1 stop 
XMW: 30419, pi: 8.44, NX(S/T) : 5 

MPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPGRALTGNSSVYF 
VQYKIMFSCSMKSSHQKPSGCWQHISCNFPGCRTLAKYGQRQWKNKEDCWGTQELSCDLT 
SETSDIQEPYYGRVRAASAGSYSEWSMTPRFTPWWETKIDPPVMNITQVNGSLLVILHAP 
NLPYRYQKEKNVSIEDYYELLYRVFIINNSLEKEQKVYEGAHRAVEIEALTPHSSYCWA 

EIYQPMLDRRSQRSEERCVEIP 



Important features of the protein: 
Signal peptide: 

Amino Acids 1-20 



N-glycosylation sites: 

Amino acids 55-59; 165-169; 170-174; 191-195;208-212 



N-myristoylation sites: 

Amino acids 17-23;20-26;220-226 
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FIGURED 

TGAAATGACTTCCACGGCTGGGACGGGAACCTTCCACCCACAGCTATGCCTCTGATTGGTGA 
ATGGTGAAGGTGCCTGTCTAACTTTTCTGTAAAAAGAACCAGCTGCCTCCAGGCAGCCAGCC 
CTCAAGCATCACTTACAGGACCAGAGGGACAAGACATGACTGTGATGAGGAGCTGCTTTCGC 
CAATTTAACACCAAGAAGAATTGAGGCTGCTTGGGAGGAAGGCCAGGAGGAACACGAGACTG 
AGAGATGAATTTTCAACAGAGGCTGCAAAGCCTGTGGACTTTAGCCAGACCCTTCTGCCCTC 
CTTTGCTGGCGACAGCCTCTCAAATGCAGATGGTTGTGCTCCCTTGCCTGGGTTTTACCCTG 
CTTCTCTGGAGCCAGGTATCAGGGGCCCAGGGCCAAGAATTCCACTTTGGGCCCTGCCAAGT 
GAAGGGGGTTGTTCCCCAGAAACTGTGGGAAGCCTTCTGGGCTGTGAAAGACACTATGCAAG 
CTCAGGATAACATCACGAGTGCCCGGCTGCTGCAGCAGGAGGTTCTGCAGAACGTCTCGGAT 
GCTGAGAGCTGTTACCTTGTCCACACCCTGCTGGAGTTCTACTTGAAAACTGTTTTCAAAAA 
CCACGACAATAGAACAGTTGAAGTCAGGACTCTGAAGTCATTCTCTACTCTGGCCAACAACT 
TTGTTCTCATCGTGTCACAACTGCAACCCAGTCAAGAAAATGAGATGTTTTCCATCAGAGAC 
AGTGCACACAGGCGGTTTCTGCTATTCCGGAGAGCATTCAAACAGTTGGACGTAGAAGCAGC 
TCTGACCAAAGCCCTTGGGGAAGTGGACATTCTTCTGACCTGGATGCAGAAATTCTACAAGC 
TCTGAATGTCTAGACCAGGACCTCCCTCCCCCTGGCACTGGTTTGTTCCCTGTGTCATTTCA 
AACAGTCTCCCTTCCTATGCTGTTCACTGGACACTTCACGCCCTTGGCCATGGGTCCCATTC 
TTGGCCCAGGATTATTGTCAAAGAAGTCATTCTTTAAGCAGCGCCAGTGACAGTCAGGGAAG 
GTGCCTCTGGATGCTGTGAAGAGTCTACAGAGAAGATTCTTGTATTTATTACAACTCTATTT 
AATTAATGTCAGTATTTCAACTGAAGTTCTATTTATTTGTGAGACTGTAAGTTACATGAAGG 
CAGCAGAATATTGTGCCCCATGCTTCTTTACCCCTCACAATCCTTGCCACAGTGTGGGGCAG 
TGGATGGGTGCTTAGTAAGTACTTAATAAACTGTGGTGCTTTTTTTGGCCTGTCTTTGGATT 
GTTAAAAAACAGAGAGGGATGCTTGGATGTAAAACTGAACTTCAGAGCATGAAAATCACACT 
GTCTTCTGATATCTGCAGGGACAGAGCATTGGGGTGGGGGTAAGGTGCATCTGTTTGAAAAG 
TAAACGATAAAATGTGGATTAAAGTGCCCAGCACAAAGCAGATCCTCAATAAACATTTCATT 
TCCCACCCACACTCGCCAGCTCACCCCATCATCCCTTTCCCTTGGTGCCCTCCTTTTTTTTT 
TATCCTAGTCATTCTTCCCTAATCTTCCACTTGAGTGTCAAGCTGACCTTGCTGATGGTGAC 
ATTGCACCTGGATGTACTATCCAATCTGTGATGACATTCCCTGCTAATAAAAGACAACATAA 
CTCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 4 

></usr/seqdb2/sst/DNA/Dnaseqs.full/ss,DNA88002 
xsubunit 1 of 1, 206 aa, 1 stop 
><MW: 23799, pi: 9.12, NX(S/T): 3 

MNFQQRLQSLWTLARPrcPPLIATASQMQMVVLPCLGFTLLLWSQVSGAQGQEFHFGPCQVK 
G WPQKLWEAFWAVKDTMQAQDN I T S ARLLQQE VLQNVS DAESC YLVHTLLEFYLKT V FKNH 
HNRTVEVRTLKSFSTLANNFVLIVSQLQPSQENEMFSIRDSAHRRFLLFRRAFKQLDVEAAL 
TKALGEVDILLTWMQKFYKL 

Signal sequence: 
amino acids 1-42 

N-glycosylation sites* 

amino acids 85-89, 99-103, 126-130 
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CYTOKINE RECEPTOR AND NUCLEIC regulated in part by phosphorylation of various cellular 

ACIDS ENCODING THE SAME proteins. Protein tyrosine kinases, enzymes that catalyze that 

process, can also act as growth factor receptors. Examples 

This application claims priority under 35 U.S.C. § 119(e) include fibroblast growth factor receptor and nerve growth 

of U.S. provisional Application Serial No. 60/191,015 filed 5 factor receptor. 

on Mar. 21, 2000, and claims priority under 35 U.S.C § 119 Membrane-bound proteins and receptor molecules have 

(a)-(d) of PCT Application Nos. PCT/US00/08439 filed various industrial applications, including as pharmaceutical 

Mar. 30, 2000 and PCT/US01/06520 filed Feb. 28, 2001, and and diagnostic agents. Receptor immunoadhesins, for 

is a continuation-in-part, claiming priority under 35 U.S.C. instance, can be employed as therapeutic agents to block 

§ 120, of U.S. application Ser. No. 09/941,992 filed Aug. 28, 10 receptor-ligand interactions. The membrane-bound proteins 

2001 the entire disclosures of which are hereby expressly can also be employed for screening of potential peptide or 

incorporated by reference. small molecule inhibitors of the relevant receptor/ligand 

interaction. 

FIELD OF THE INVENTION Efforts are bdng undeitaken by bolh mduslry aca- 

The present invention relates generally to the identifica- 15 demia to identify new, native receptor or membrane-bound 

tion and isolation of novel DNA and to the recombinant proteins. Many efforts are focused on the screening of 

production of novel cytokine receptors, designated herein as mammalian recombinant DNA libraries to identify the cod- 

"PR019598" polypeptides. ing sequences for novel receptor or membrane-bound pro- 
teins. 

BACKGROUND OF THE INVENTION 20 "Decoy" receptors are well documented in the art. A 

Extracellular proteins play important roles in, among dec °y receptor is generally defined as a receptor that does 

other things, the formation, differentiation and maintenance not transduce an intracellular signal due to lack of a required 

of multicellular organisms. The fate of many individual domain. The decoy receptor may be membrane bound or it 

cells, e.g., proliferation, migration, differentiation, or inter- 25 ma Y a soluble secreted form. Decoy receptors represent 

action with other cells, is typically governed by information a uni <l ue mechanism for regulating the sensitivity of cells to 

received from other cells and/or the immediate environment. a particular ligand, as increases in the amount of decoy 

This information is often transmitted by secreted polypep- receptor results in a decrease in the amount of ligand that is 

tides (for instance, mitogenic factors, survival factors, cyto- available to bind to functional receptors. This titration of 

toxic factors, differentiation factors, neuropeptides, and 30 u g and b Y a dec °y receptor may modulate the cellular 

hormones) which are, in turn, received and interpreted by response to the ligand, for example resulting m decreased or 

diverse cell receptors or membrane-bound proteins. These increased inflammation, differentiation, proliferation or apo- 

secreted polypeptides or signaling molecules normally pass ptosis. 

through the cellular secretory pathway to reach their site of We herein describe the identification and characterization 

action in the extracellular environment. 35 of novel cytokine receptors, designated herein as PR019598 

Secreted proteins have various industrial applications, polypeptides, 

including as pharmaceuticals, diagnostics, biosensors and SUMMARY OF THE INVENTION 

bioreactors. Most protein drugs available at present, such as ^ ^ 

thrombolytic agents, interferons, interleukins, A cDNA clone (designated herein as DNA1 45887-2849) 

erythropoietins, colony stimulating factors, and various 40 has been identified that encodes a cytokine receptor desig- 

other cytokines, are secretory proteins. Their receptors, nated in the present application as" PR019598". PR0195 98 

which are membrane proteins, also have potential as thera- is a secreted cytokine receptor (otherwise known as a Decoy 

peutic or diagnostic agents. Efforts are being undertaken by receptor) which binds to a cytokine in the IL-10 family 

both industry and academia to identify new, native secreted which is designated herein as a PRO3301 polypeptide ligand 

proteins. Many efforts are focused on the screening of 45 (shown in FIG. 4; SEQ ID NO:7). The PR03301 polypep- 

mammalian recombinant DNA libraries to identify the cod- tide ligand is encoded by a nucleic acid molecule designated 

ing sequences for novel secreted proteins. Examples of herein as DNA88002 (shown in FIG. 3; SEQ ID NO:6). 

screening methods and techniques are described in the In one embodiment, the invention provides an isolated 

literature [see, for example, Klein et al., Proc. Natl Acad. nucleic acid molecule comprising a nucleotide sequence that 

ScL, 93:7108-7113 (1996); U.S. Pat. No. 5,536,637)]. 50 encodes a PR019598 polypeptide. 

Membrane-bound proteins and receptors can play impor- In one aspect, the isolated nucleic acid molecule com- 

tant roles in, among other things, the formation, differen- prises a nucleotide sequence having at least about 80% 

tiation and maintenance of multicellular organisms. The fate nucleic acid sequence identity, alternatively at least about 

of many individual cells, e.g., proliferation, migration, 81% nucleic acid sequence identity, alternatively at least 

differentiation, or interaction with other cells, is typically 55 about 82% nucleic acid sequence identity, alternatively at 

governed by information received from other cells and/or least about 83% nucleic acid sequence identity, alternatively 

the immediate environment. This information is often trans- at least about 84% nucleic acid sequence identity, alterna- 

mitted by secreted polypeptides (for instance, mitogenic lively at least about 85% nucleic acid sequence identity, 

factors, survival factors, cytotoxic factors, differentiation alternatively at least about 86% nucleic acid sequence 

factors, neuropeptides, and hormones) which are, in turn, 60 identity, alternatively at least about 87% nucleic acid 

received and interpreted by diverse cell receptors or sequence identity, alternatively at least about 88% nucleic 

membrane-bound proteins. Such membrane-bound proteins acid sequence identity, alternatively at least about 89% 

and cell receptors include, but are not limited to, cytokine nucleic acid sequence identity, alternatively at least about 

receptors, receptor kinases, receptor phosphatases, receptors 90% nucleic acid sequence identity, alternatively at least 

involved in cell<ell interactions, and cellular adhesin mol- 65 about 91% nucleic acid sequence identity, alternatively at 

ecules like selectins and integrins. For instance, transduction least about 92% nucleic acid sequence identity, alternatively 

of signals that regulate cell growth and differentiation is at least about 93% nucleic acid sequence identity, alterna- 
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tively at least about 94% nucleic acid sequence identity, about 95% nucleic acid sequence identity, alternatively at 

alternatively at least about 95% nucleic acid sequence least about 96% nucleic acid sequence identity, alternatively 

identity, alternatively at least about 96% nucleic acid at least about 97% nucleic acid sequence identity, alterna- 

sequence identity, alternatively at least about 97% nucleic tively at least about 98% nucleic acid sequence identity and 

acid sequence identity, alternatively at least about 98% 5 alternatively at least about 99% nucleic acid sequence 

nucleic acid sequence identity and alternatively at least identity to (a) a DNA molecule that encodes the same mature 

about 99% nucleic acid sequence identity to (a) a DNA polypeptide encoded by any of the human protein cDNA 

molecule encoding a PR019598 polypeptide having the deposited with the ATCC on Mar. 21, 2000 under ATCC 

sequence of amino acid residues from 1 or 21 to about 262, Deposit No. PTA-1532 (DNA1 45887-2849), or (b) the 

inclusive, of FIG. 2 (SEQ ID NO:2), or (b) the complement 3Q complement of the DNA molecule of (a). In a preferred 

of the nucleotide sequence of (a). embodiment, the isolated nucleic acid molecule comprises 

In another aspect, the isolated nucleic acid molecule (a) a nucleotide sequence encoding the same mature 

comprises (a) a nucleotide sequence encoding a PR019598 polypeptide encoded by the human protein cDNA deposited 

polypeptide having the sequence of amino acid residues with the ATCC on Mar. 21, 2000 under ATCC Deposit No. 

from 1 or 21 to about 262, inclusive, of FIG. 2 (SEQ ID 15 PTA-1532 (DNA145887-2849), or (b) the complement of 

NO:2), or (b) the complement of the nucleotide sequence of the DNA molecule of (a). 

(a) . In a another aspect, the invention concerns an isolated 
In other aspects, the isolated nucleic acid molecule com- nucleic acid molecule comprising a nucleotide sequence 

prises a nucleotide sequence having at least about 80% having at least about 80% nucleic acid sequence identity, 

nucleic acid sequence identity, alternatively at least about 2 o alternatively at least about 81% nucleic acid sequence 

81% nucleic acid sequence identity, alternatively at least identity, alternatively at least about 82% nucleic acid 

about 82% nucleic acid sequence identity, alternatively at sequence identity, alternatively at least about 83% nucleic 

least about 83% nucleic acid sequence identity, alternatively acid sequence identity, alternatively at least about 84% 

at least about 84% nucleic acid sequence identity, alterna- nucleic acid sequence identity, alternatively at least about 

tively at least about 85% nucleic acid sequence identity, 2 s nucleic acid sequence identity, alternatively at least 

alternatively at least about 86% nucleic acid sequence about 86% nucleic acid sequence identity, alternatively at 

identity, alternatively at least about 87% nucleic acid least about 87% nucleic acid sequence identity, alternatively 

sequence identity, alternatively at least about 88% nucleic at least about 88% nucleic acid sequence identity, altema- 

acid sequence identity, alternatively at least about 89% tively at least about 89% nucleic acid sequence identity, 

nucleic acid sequence identity, alternatively at least about 30 alternatively at least about 90% nucleic acid sequence 

90% nucleic acid sequence identity, alternatively at least identity, alternatively at least about 91% nucleic acid 

about 91% nucleic acid sequence identity, alternatively at sequence identity, alternatively at least about 92% nucleic 

least about 92% nucleic acid sequence identity, alternatively acid sequence identity, alternatively at least about 93% 

at least about 93% nucleic acid sequence identity, alterna- nucleic acid sequence identity, alternatively at least about 

tively at least about 94% nucleic acid sequence identity, 35 94% nucleic acid sequence identity, alternatively at least 

alternatively at least about 95% nucleic acid sequence about 95% nucleic acid sequence identity, alternatively at 

identity, alternatively at least about 96% nucleic acid least about 96% nucleic acid sequence identity, alternatively 

sequence identity, alternatively at least about 97% nucleic at least about 97% nucleic acid sequence identity, alterna- 

acid sequence identity, alternatively at least about 98% tively at least about 98% nucleic acid sequence identity and 

nucleic acid sequence identity and alternatively at least 40 alternatively at least about 99% nucleic acid sequence 

about 99% nucleic acid sequence identity to (a) a DNA identity to (a) the full-length polypeptide coding sequence of 

molecule having the sequence of nucleotides from 241 or the human protein cDNA deposited with the ATCC on Mar. 

301 to about 1026, inclusive, of FIG. 1 (SEQ ID NO:l), or 21, 2000 under ATCC Deposit No. PTA-1532 (DNA145887- 

(b) the complement of the nucleotide sequence of (a). 2849), or (b) the complement of the DNA molecule of (a). 
In another aspect, the isolated nucleic acid molecule 45 1° a preferred embodiment, the isolated nucleic acid mol- 

comprises (a) the nucleotide sequence from 241 or 301 to ecule comprises (a) the full-length coding sequence of the 

about 1026, inclusive, of FIG. 1 (SEQ ID NO:l), or (b) the human protein cDNA deposited with the ATCC on Mar. 21, 

complement of the nucleotide sequence of (a). 2000 under AT CC Deposit No. PTA-1532 (DNA145887- 

In a further aspect, the invention concerns an isolated 7S49) 9 or (b) the complement of the DNA molecule of (a), 

nucleic acid molecule comprising a nucleotide sequence 50 In another aspect, the invention concerns an isolated 

having at least about 80% nucleic acid sequence identity, nucleic acid molecule which encodes an active PR019598 

alternatively at least about 81% nucleic acid sequence polypeptide as defined below comprising a nucleotide 

identity, alternatively at least about 82% nucleic acid sequence that hybridizes to the complement of a nucleic acid 

sequence identity, alternatively at least about 83% nucleic sequence that encodes amino acids 1 or 21 to about 262, 

acid sequence identity, alternatively at least about 84% 55 inclusive, of FIG. 2 (SEQ ID NO:2). Preferably, hybridiza- 

nucleic acid sequence identity, alternatively at least about ^on occurs under stringent hybridization and wash condi- 

85% nucleic acid sequence identity, alternatively at least tions. 

about 86% nucleic acid sequence identity, alternatively at In yet another aspect, the invention concerns an isolated 

least about 87% nucleic acid sequence identity, alternatively nucleic acid molecule which encodes an active PRO 195 98 

at least about 88% nucleic acid sequence identity, alterna- 60 polypeptide as defined below comprising a nucleotide 

tively at least about 89% nucleic acid sequence identity, sequence that hybridizes to the complement of the nucleic 

alternatively at least about 90% nucleic acid sequence acid sequence between nucleotides 241 or 301 and about 

identity, alternatively at least about 91% nucleic acid 1026, inclusive, of FIG. 1 (SEQ ID NO:l). Preferably, 

sequence identity, alternatively at least about 92% nucleic hybridization occurs under stringent hybridization and wash 

acid sequence identity, alternatively at least about 93% 65 conditions. 

nucleic acid sequence identity, alternatively at least about In a further aspect, the invention concerns an isolated 

94% nucleic acid sequence identity, alternatively at least nucleic acid molecule having at least about 406 nucleotides 
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and which is produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encod- 
ing a PRO 195 98 polypeptide having the sequence of amino 
acid residues from 1 or 21 to about 262, inclusive, of FIG. 
2 (SEQ ID NO:2), or (b) the complement of the DNA 
molecule of (a), and, if the test DNA molecule has at least 
about an 80% nucleic acid sequence identity, alternatively at 
least about 81% nucleic acid sequence identity, alternatively 
at least about 82% nucleic acid sequence identity, alterna- 
tively at least about 83% nucleic acid sequence identity, 
alternatively at least about 84% nucleic acid sequence 
identity, alternatively at least about 85% nucleic acid 
sequence identity, alternatively at least about 86% nucleic 
acid sequence identity, alternatively at least about 87% 
nucleic acid sequence identity, alternatively at least about 
88% nucleic acid sequence identity, alternatively at least 
about 89% nucleic acid sequence identity, alternatively at 
least about 90% nucleic acid sequence identity, alternatively 
at least about 91% nucleic acid sequence identity, alterna- 
tively at least about 92% nucleic acid sequence identity, 
alternatively at least about 93% nucleic acid sequence 
identity, alternatively at least about 94% nucleic acid 
sequence identity, alternatively at least about 95% nucleic 
acid sequence identity, alternatively at least about 96% 
nucleic acid sequence identity, alternatively at least about 
97% nucleic acid sequence identity, alternatively at least 
about 98% nucleic acid sequence identity and alternatively 
at least about 99% nucleic acid sequence identity to (a) or 
(b), and isolating the test DNA molecule. 

In another aspect, the invention concerns an isolated 
nucleic acid molecule comprising (a) a nucleotide sequence 
encoding a polypeptide scoring at least about 80% positives, 
alternatively at least about 81% positives, alternatively at 
least about 82% positives, alternatively at least about 83% 
positives, alternatively at least about 84% positives, alter- 
natively at least about 85% positives, alternatively at least 
about 86% positives, alternatively at least about 87% 
positives, alternatively at least about 88% positives, alter- 
natively at least about 89% positives, alternatively at least 
about 90% positives, alternatively at least about 91% 
positives, alternatively at least about 92% positives, alter- 
natively at least about 93% positives, alternatively at least 
about 94% positives, alternatively at least about 95% 
positives, alternatively at least about 96% positives, alter- 
natively at least about 97% positives, alternatively at least 
about 98% positives and alternatively at least about 99% 
positives when compared with the amino acid sequence of 
residues 1 or 21 to 262, inclusive, of FIG. 2 (SEQ ID NO:2), 
or (b) the complement of the nucleotide sequence of (a). 

In a specific aspect, the invention provides an isolated 
nucleic acid molecule comprising DNA encoding a 
PRO 19598 polypeptide without the N-terminal signal 
sequence and/or the initiating methionine, or is complemen- 
tary to such encoding nucleic acid molecule. The signal 
peptide has been tentatively identified as extending from 
amino acid position 1 to amino acid position 20 in the 
sequence of FIG. 2 (SEQ ID NO:2). It is noted, however, 
that the C-terminal boundary of the signal peptide may vary, 
but most likely by no more than about 5 amino acids on 
either side of the signal peptide C-terminal boundary as 
initially identified herein, wherein the C-terminal boundary 
of the signal peptide may be identified pursuant to criteria 
routinely employed in the art for identifying that type of 
amino acid sequence element (e.g., Nielsen et al.,Prot. Eng. 
10:1-6 (1997) and von Heinje et al, NucL Acids. Res. 
14:4683-4690 (1986)). Moreover, it is also recognized that, 
in some cases, cleavage of a signal sequence from a secreted 
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polypeptide is not entirely uniform, resulting in more than 
one secreted species. These polypeptides, and the polynucle- 
otides encoding them, are contemplated by the present 
invention. As such, for purposes of the present application, 
the signal peptide of the PRO 19598 polypeptide shown in 
FIG. 2 (SEQ ID NO:2) extends from amino acids 1 to X of 
FIG. 2 (SEQ ID NO:2), wherein X is any amino acid from 
16 to 25 of FIG. 2 (SEQ ID NO:2). Therefore, mature forms 
of the PR019598 polypeptide which are encompassed by 
the present invention include those comprising amino acids 
X to 262 of FIG. 2 (SEQ ID NO:2), wherein X is any amino 
acid from 16 to 25 of FIG. 2 (SEQ ID NO:2) and variants 
thereof as described below. Isolated nucleic acid molecules 
encoding these polypeptides are also contemplated. 

Another embodiment is directed to fragments of a 
PRO 19598 polypeptide coding sequence that may find use 
as, for example, hybridization probes or for encoding frag- 
ments of a PRO 19598 polypeptide that may optionally 
encode a polypeptide comprising a binding site for an 
anti-PR019598 antibody. Such nucleic acid fragments are 
usually at least about 20 nucleotides in length, alternatively 
at least about 30 nucleotides in length, alternatively at least 
about 40 nucleotides in length, alternatively at least about 50 
nucleotides in length, alternatively at least about 60 nucle- 
otides in length, alternatively at least about 70 nucleotides in 
length, alternatively at least about 80 nucleotides in length, 
alternatively at least about 90 nucleotides in length, alter- 
natively at least about 100 nucleotides in length, alterna- 
tively at least about 110 nucleotides in length, alternatively 
at least about 120 nucleotides in length, alternatively at least 
about 130 nucleotides in length, alternatively at least about 
140 nucleotides in length, alternatively at least about 150 
nucleotides in length, alternatively at least about 160 nucle- 
otides in length, alternatively at least about 170 nucleotides 
in length, alternatively at least about 180 nucleotides in 
length, alternatively at least about 190 nucleotides in length, 
alternatively at least about 200 nucleotides in length, alter- 
natively at least about 250 nucleotides in length, alterna- 
tively at least about 300 nucleotides in length, alternatively 
at least about 350 nucleotides in length, alternatively at least 
about 400 nucleotides in length, alternatively at least about 
450 nucleotides in length, alternatively at least about 500 
nucleotides in length, alternatively at least about 600 nucle- 
otides in length, alternatively at least about 700 nucleotides 
in length, alternatively at least about 800 nucleotides in 
length, alternatively at least about 900 nucleotides in length 
and alternatively at least about 1000 nucleotides in length, 
wherein in this context the term "about" means the refer- 
enced nucleotide sequence length plus or minus 10% of that 
referenced length. In a preferred embodiment, the nucleotide 
sequence fragment is derived from any coding region of the 
nucleotide sequence shown in FIG. 1 (SEQ ID NO:l). It is 
noted that novel fragments of a PRO 19598 polypeptide- 
encoding nucleotide sequence may be determined in a 
routine manner by aligning the PR019598 polypeptide- 
encoding nucleotide sequence with other known nucleotide 
sequences using any of a number of well known sequence 
alignment programs and determining which PRO 19598 
polypeptide-encoding nucleotide sequence fragment(s) are 
novel. All of such PRO 195 98 polypeptide-encoding nucle- 
otide sequences are contemplated herein and can be deter- 
mined without undue experimentation. Also contemplated 
are the PRO 19598 polypeptide fragments encoded by these 
nucleotide molecule fragments, preferably those PR019598 
polypeptide fragments that comprise a binding site for an 
anti-PR019598 antibody. 

In an other embodiment, the invention provides a vector 
comprising a nucleotide sequence encoding PRO 19598 or 
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its variants. The vector may comprise any of the isolated 
nucleic acid molecules hereinabove identified. 

A host cell comprising such a vector is also provided. By 
way of example, the host cells may be CHO cells, E. coli, 
yeast, or Baculovirus-infected insect cells. A process for 5 
producing PR019598 polypeptides is further provided and 
comprises culturing host cells under conditions suitable for 
expression of PR019598 and recovering PR019598 from 
the cell culture. 

In another embodiment, the invention provides isolated 10 
PR019598 polypeptide encoded by any of the isolated 
nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native 
sequence PR019598 polypeptide, which in certain 15 
embodiments, includes an amino acid sequence comprising 
residues from 1 or 21 to about 262 of FIG. 2 (SEQ ID NO:2). 

In another aspect, the invention concerns an isolated 
PRO 19598 polypeptide, comprising an amino acid sequence 
having at least about 80% amino acid sequence identity, 2 o 
alternatively at least about 81% amino acid sequence 
identity, alternatively at least about 82% amino acid 
sequence identity, alternatively at least about 83% amino 
acid sequence identity, alternatively at least about 84% 
amino acid sequence identity, alternatively at least about 2 s 
85% amino acid sequence identity, alternatively at least 
about 86% amino acid sequence identity, alternatively at 
least about 87% amino acid sequence identity, alternatively 
at least about 88% amino acid sequence identity, alterna- 
tively at least about 89% amino acid sequence identity, 30 
alternatively at least about 90% amino acid sequence 
identity, alternatively at least about 91% amino acid 
sequence identity, alternatively at least about 92% amino 
acid sequence identity, alternatively at least about 93% 
amino acid sequence identity, alternatively at least about 35 
94% amino acid sequence identity, alternatively at least 
about 95% amino acid sequence identity, alternatively at 
least about 96% amino acid sequence identity, alternatively 
at least about 97% amino acid sequence identity, alterna- 
tively at least about 98% amino acid sequence identity and 40 
alternatively at least about 99% amino acid sequence iden- 
tity to the sequence of amino acid residues from 1 or 21 to 
about 262, inclusive, of FIG. 2 (SEQ ID NO:2). 

In a further aspect, the invention concerns an isolated 
PR019598 polypeptide comprising an amino acid sequence 45 
having at least about 80% amino acid sequence identity, 
alternatively at least about 81% amino acid sequence 
identity, alternatively at least about 82% amino acid 
sequence identity, alternatively at least about 83% amino 
acid sequence identity, alternatively at least about 84% 50 
amino acid sequence identity, alternatively at least about 
85% amino acid sequence identity, alternatively at least 
about 86% amino acid sequence identity, alternatively at 
least about 87% amino acid sequence identity, alternatively 
at least about 88% amino acid sequence identity, alterna- 55 
tively at least about 89% amino acid sequence identity, 
alternatively at least about 90% amino acid sequence 
identity, alternatively at least about 91% amino acid 
sequence identity, alternatively at least about 92% amino 
acid sequence identity, alternatively at least about 93% 60 
amino acid sequence identity, alternatively at least about 
94% amino acid sequence identity, alternatively at least 
about 95% amino acid sequence identity, alternatively at 
least about 96% amino acid sequence identity, alternatively 
at least about 97% amino acid sequence identity, alterna- 65 
tively at least about 98% amino acid sequence identity and 
alternatively at least about 99% amino acid sequence iden- 



tity to an amino acid sequence encoded by the human protein 
cDNA deposited with the ATCC on Mar. 21, 2000 under 
ATCC Deposit No. PTA-1532 (DNA1 45887-2849). In a 
preferred embodiment, the isolated PRO 19598 polypeptide 
comprises an amino acid sequence encoded by the human 
protein cDNA deposited with the ATCC on Mar. 21, 2000 
under ATCC Deposit No. PTA-1532 (DNA145887-2849). 

In a further aspect, the invention concerns an isolated 
PR019598 polypeptide comprising an amino acid sequence 
scoring at least about 80% positives, alternatively at least 
about 81% positives, alternatively at least about 82% 
positives, alternatively at least about 83% positives, alter- 
natively at least about 84% positives, alternatively at least 
about 85% positives, alternatively at least about 86% 
positives, alternatively at least about 87% positives, alter- 
natively at least about 88% positives, alternatively at least 
about 89% positives, alternatively at least about 90% 
positives, alternatively at least about 91% positives, alter- 
natively at least about 92% positives, alternatively at least 
about 93% positives, alternatively at least about 94% 
positives, alternatively at least about 95% positives, alter- 
natively at least about 96% positives, alternatively at least 
about 97% positives, alternatively at least about 98% posi- 
tives and alternatively at least about 99% positives when 
compared with the amino acid sequence of residues from 1 
or 21 to about 262, inclusive, of FIG. 2 (SEQ ID NO:2). 

In a specific aspect, the invention provides an isolated 
PR019598 polypeptide without the N-terminal signal 
sequence and/or the initiating methionine and is encoded by 
a nucleotide sequence that encodes such an amino acid 
sequence as hereinbefore described. Processes for producing 
the same are also herein described, wherein those processes 
comprise culturing a host cell comprising a vector which 
comprises the appropriate encoding nucleic acid molecule 
under conditions suitable for expression of the PR019598 
polypeptide and recovering the PR019598 polypeptide from 
the cell culture. 

In yet another aspect, the invention concerns an isolated 
PR019598 polypeptide, comprising the sequence of amino 
acid residues from 1 or 21 to about 262, inclusive, of FIG. 
2 (SEQ ID NO:2), or a fragment thereof which is biologi- 
cally active or sufficient to provide a binding site for an 
anti-PR019598 antibody, wherein the identification of 
PRO 19598 polypeptide fragments that possess biological 
activity or provide a binding site for an anti-PR019598 
antibody may be accomplished in a routine manner using 
techniques which are well known in the art. Preferably, the 
PR019598 fragment retains a qualitative biological activity 
of a native PR019598 polypeptide. 

In a still further aspect, the invention provides a polypep- 
tide produced by (i) hybridizing a test DNA molecule under 
stringent conditions with (a) a DNA molecule encoding a 
PR019598 polypeptide having the sequence of amino acid 
residues from 1 or 21 to about 262, inclusive, of FIG. 2 (SEQ 
ID NO:2), or (b) the complement of the DNA molecule of 
(a), and if the test DNA molecule has at least about an 80% 
nucleic acid sequence identity, alternatively at least about 
81% nucleic acid sequence identity, alternatively at least 
about 82% nucleic acid sequence identity, alternatively at 
least about 83% nucleic acid sequence identity, alternatively 
at least about 84% nucleic acid sequence identity, alterna- 
tively at least about 85% nucleic acid sequence identity, 
alternatively at least about 86% nucleic acid sequence 
identity, alternatively at least about 87% nucleic acid 
sequence identity, alternatively at least about 88% nucleic 
acid sequence identity, alternatively at least about 89% 
nucleic acid sequence identity, alternatively at least about 
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90% nucleic acid sequence identity, alternatively at least 
about 91% nucleic acid sequence identity, alternatively at 
least about 92% nucleic acid sequence identity, alternatively 
at least about 93% nucleic acid sequence identity, alterna- 
tively at least about 94% nucleic acid sequence identity, 
alternatively at least about 95% nucleic acid sequence 
identity, alternatively at least about 96% nucleic acid 
sequence identity, alternatively at least about 97% nucleic 
acid sequence identity, alternatively at least about 98% 
nucleic acid sequence identity and alternatively at least 
about 99% nucleic acid sequence identity to (a) or (b), (ii) 
culturing a host cell comprising the test DNA molecule 
under conditions suitable for expression of the polypeptide, 
and (iii) recovering the polypeptide from the cell culture. 

In another embodiment, the invention provides chimeric 
molecules comprising a PR019598 polypeptide fused to a 
heterologous polypeptide or amino acid sequence, wherein 
the PR019598 polypeptide may comprise any PR019598 
polypeptide, variant or fragment thereof as hereinbefore 
described. An example of such a chimeric molecule com- 
prises a PRO 19598 polypeptide fused to an epitope tag 
sequence or a Fc region of an immunoglobulin. 

In another embodiment, the invention provides an anti- 
body as defined below which specifically binds to a 
PR019598 polypeptide as hereinbefore described. 
Optionally, the antibody is a monoclonal antibody, an anti- 
body fragment or a single chain antibody. 

In yet another embodiment, the invention concerns ago- 
nists and antagonists of a native PRO 19598 polypeptide as 
defined below. In a particular embodiment, the agonist or 
antagonist is an anti-PR019598 antibody or a small mol- 
ecule. 

In a further embodiment, the invention concerns a method 
of identifying agonists or antagonists to a PRO 19598 
polypeptide which comprise contacting the PRO 19598 
polypeptide with a candidate molecule and monitoring a 
biological activity mediated by said PRO 19598 polypeptide. 
Preferably, the PR019598 polypeptide is a native 
PR019598 polypeptide. 

In a still further embodiment, the invention concerns a 
composition of matter comprising a PRO 19598 polypeptide, 
or an agonist or antagonist of a PRO 19598 polypeptide as 
herein described, or an anti-PR019598 antibody, in combi- 
nation with a carrier. Optionally, the carrier is a pharmaceu- 
tically acceptable carrier. 

Another embodiment of the present invention is directed 
to the use of a PRO 19598 polypeptide, or an agonist or 
antagonist thereof as herein described, or an anti-PR019598 
antibody, for the preparation of a medicament useful in the 
treatment of a condition which is responsive to the 
PR019598 polypeptide, an agonist or antagonist thereof or 
an anti-PR019598 antibody. 

In yet other embodiments, the invention provides oligo- 
nucleotide probes which may be useful for isolating 
genomic and cDNA nucleotide sequences, measuring or 
detecting expression of an associated gene or as antisense 
probes, wherein those probes may be derived from any of 
the above or below described nucleotide sequences. Pre- 
ferred probe lengths are described above. 

In yet other embodiments, the present invention is 
directed to methods of using the PRO 19598 polypeptides of 
the present invention for a variety of uses based upon the 
functional biological assay data presented in the Examples 
below. In a particular embodiment, a PRO 19598 polypeptide 
functions as a cytokine receptor that acts as a Decoy receptor 
and a natural antagonist of a cytokine in the IL-10 family 
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[designated herein as a PRO3301 polypeptide which is 
encoded by a nucleic acid molecule designated herein as 
DNA88002], 

5 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows a nucleotide sequence (SEQ ID NO:l) of a 
native sequence PR019598 cDNA, wherein SEQ ID NO:l 
is a clone designated herein as "DNA145887-2849". 
10 FIG. 2 shows the amino acid sequence (SEQ ID NO:2) 
derived from the coding sequence of SEQ ID NO: 1 shown 
in FIG. 1. 

FIG. 3 shows a nucleotide sequence (SEQ ID NO:6) of a 
native sequence PRO3301 cDNA, wherein SEQ ID NO:3 is 
15 a clone designated herein as "DNA88002". 

FIG. 4 shows the amino acid sequence (SEQ ID NO: 7) 
derived from the coding sequence of SEQ ID NO: 6 shown 
in FIG. 3. 

20 DETAILED DESCRIPTION OF THE 

PREFERRED EMBODIMENTS 

I. Definitions 

The terms "PRO polypeptide" and "PRO" as used herein 
and when immediately followed by a numerical designation 

25 refer to various polypeptides, wherein the complete desig- 
nation (i.e., PRO/number) refers to specific polypeptide 
sequences as described herein. The terms "PRO/number 
polypeptide" and "PRO/number" wherein the term "num- 
ber" is provided as an actual numerical designation as used 

30 herein encompass native sequence polypeptides and 
polypeptide variants (which are further defined herein). The 
PRO polypeptides described herein may be isolated from a 
variety of sources, such as from human tissue types or from 
another source, or prepared by recombinant or synthetic 

35 methods. The term "PRO polypeptide" refers to each indi- 
vidual PRO/number polypeptide disclosed herein. All dis- 
closures in this specification which refer to the "PRO 
polypeptide" refer to each of the polypeptides individually 
as well as jointly. For example, descriptions of the prepa- 

40 ration of, purification of, derivation of, formation of anti- 
bodies to or against, administration of, compositions 
containing, treatment of a disease with, etc., pertain to each 
polypeptide of the invention individually. The term "PRO 
polypeptide" also includes variants of the PRO/number 

45 polypeptides disclosed herein. 

A "native sequence PRO 195 98 polypeptide" comprises a 
polypeptide having the same amino acid sequence as the 
corresponding PR019598 polypeptide derived from nature. 
Such native sequence PR019598 polypeptides can be iso- 

50 lated from nature or can be produced by recombinant or 
synthetic means. The term "native sequence 19598 polypep- 
tide" specifically encompasses naturally-occurring truncated 
or secreted forms of the specific PRO 19598 polypeptide 
(e.g., an extracellular domain sequence), naturally-occurring 

55 variant forms (e.g., alternatively spliced forms) and 
naturally -occurring allelic variants of the polypeptide. In 
various embodiments of the invention, the native sequence 
PR019598 polypeptides disclosed herein are mature or 
full-length native sequence polypeptides comprising the 

60 full-length amino acids sequences shown in the accompa- 
nying figures. Start and stop codons are shown in bold font 
and underlined in the figures. However, while the 
PRO 19598 polypeptide disclosed in the accompanying fig- 
ures are shown to begin with methionine residues designated 

65 herein as amino acid position 1 in the figures, it is conceiv- 
able and possible that other methionine residues located 
either upstream or downstream from the amino acid position 
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1 in the figures may be employed as the starting amino acid 
residue for the PR019598 polypeptides. 

The PR019598 polypeptide "extracellular domain" or 
"ECD" refers to a form of the PR019598 polypeptide which 
is essentially free of the transmembrane and cytoplasmic 
domains. Ordinarily, a PR019598 polypeptide ECD will 
have less than 1% of such transmembrane and/or cytoplas- 
mic domains and preferably, will have less than 0.5% of 
such domains. It will be understood that any transmembrane 
domains identified for the PRO 195 98 polypeptides of the 
present invention are identified pursuant to criteria routinely 
employed in the art for identifying that type of hydrophobic 
domain. The exact boundaries of a transmembrane domain 
may vary but most likely by no more than about 5 amino 
acids at either end of the domain as initially identified 
herein. Optionally, therefore, an extracellular domain of a 
PRO 19598 polypeptide may contain from about 5 or fewer 
amino acids on either side of the transmembrane domain/ 
extracellular domain boundary as identified in the Examples 
or specification and such polypeptides, with or without the 
associated signal peptide, and nucleic acid encoding them, 
are comtemplated by the present invention. 

The approximate location of the "signal peptides" of the 
various PRO 195 98 polypeptides disclosed herein are shown 
in the present specification and/or the accompanying figures. 
It is noted, however, that the C-terminal boundary of a signal 
peptide may vary, but most likely by no more than about 5 
amino acids on either side of the signal peptide C-terminal 
boundary as initially identified herein, wherein the 
C-terminal boundary of the signal peptide may be identified 
pursuant to criteria routinely employed in the art for iden- 
tifying that type of amino acid sequence element (e.g., 
Nielsen et al., Prot. Eng. 10:1-6 (1997) and von Heinje et 
al, Nucl. Acids. Res. 14:4683^690 (1986)). Moreover, it is 
also recognized that, in some cases, cleavage of a signal 
sequence from a secreted polypeptide is not entirely 
uniform, resulting in more than one secreted species. These 
mature polypeptides, where the signal peptide is cleaved 
within no more than about 5 amino acids on either side of the 
C-terminal boundary of the signal peptide as identified 
herein, and the polynucleotides encoding them, are contem- 
plated by the present invention. 

"PR019598 polypeptide variant" means an active 
PRO 19598 polypeptide as defined above or below having at 
least about 80% amino acid sequence identity with a full- 
length native sequence PRO 19598 polypeptide sequence as 
disclosed herein, a PR019598 polypeptide sequence lacking 
the signal peptide as disclosed herein, an extracellular 
domain of a PRO 195 98 polypeptide, with or without the 
signal peptide, as disclosed herein or any other fragment of 
a full-length PRO 195 98 polypeptide sequence as disclosed 
herein. Such PRO 195 98 polypeptide variants include, for 
instance, PR019598 polypeptides wherein one or more 
amino acid residues are added, or deleted, at the N- or 
C-terminus of the full-length native amino acid sequence. 
Ordinarily, a PR019598 polypeptide variant will have at 
least about 80% amino acid sequence identity, alternatively 
at least about 81% amino acid sequence identity, alterna- 
tively at least about 82% amino acid sequence identity, 
alternatively at least about 83% amino acid sequence 
identity, alternatively at least about 84% amino acid 
sequence identity, alternatively at least about 85% amino 
acid sequence identity, alternatively at least about 86% 
amino acid sequence identity, alternatively at least about 
87% amino acid sequence identity, alternatively at least 
about 88% amino acid sequence identity, alternatively at 
least about 89% amino acid sequence identity, alternatively 



at least about 90% amino acid sequence identity, alterna- 
tively at least about 91% amino acid sequence identity, 
alternatively at least about 92% amino acid sequence 
identity, alternatively at least about 93% amino acid 

5 sequence identity, alternatively at least about 94% amino 
acid sequence identity, alternatively at least about 95% 
amino acid sequence identity, alternatively at least about 
96% amino acid sequence identity, alternatively at least 
about 97% amino acid sequence identity, alternatively at 

io least about 98% amino acid sequence identity and alterna- 
tively at least about 99% amino acid sequence identity to a 
full-length native sequence PR019598 polypeptide 
sequence as disclosed herein, a PRO 19598 polypeptide 
sequence lacking the signal peptide as disclosed herein, an 

15 extracellular domain of a PRO 19598 polypeptide, with or 
without the signal peptide, as disclosed herein or any other 
specifically defined fragment of a full-length PR019598 
polypeptide sequence as disclosed herein. Ordinarily, 
PR019598 variant polypeptides are at least about 10 amino 

20 acids in length, alternatively at least about 20 amino acids in 
length, alternatively at least about 30 amino acids in length, 
alternatively at least about 40 amino acids in length, alter- 
natively at least about 50 amino acids in length, alternatively 
at least about 60 amino acids in length, alternatively at least 

25 about 70 amino acids in length, alternatively at least about 
80 amino acids in length, alternatively at least about 90 
amino acids in length, alternatively at least about 100 amino 
acids in length, alternatively at least about 150 amino acids 
in length, alternatively at least about 200 amino acids in 

30 length, alternatively at least about 300 amino acids in length, 
or more. 

"Percent (%) amino acid sequence identity" with respect 
to the PRO polypeptide sequences identified herein is 
defined as the percentage of amino acid residues in a 

35 candidate sequence that are identical with the amino acid 
residues in the specific PRO polypeptide sequence, after 
aligning the sequences and introducing gaps, if necessary, to 
achieve the maximum percent sequence identity, and not 
considering any conservative substitutions as part of the 

40 sequence identity. Alignment for purposes of determining 
percent amino acid sequence identity can be achieved in 
various ways that are within the skill in the art, for instance, 
using publicly available computer software such as BLAST, 
BLAST-2, ALIGN or Megalign (DNASTAR) software. 

45 Those skilled in the art can determine appropriate param- 
eters for measuring alignment, including any algorithms 
needed to achieve maximal alignment over the full length of 
the sequences being compared. For purposes herein, 
however, % amino acid sequence identity values are gener- 

50 ated using the sequence comparison computer program 
ALIGN-2, wherein the complete source code for the 
ALIGN-2 program is provided in Table 1 below. The 
ALIGN-2 sequence comparison computer program was 
authored by Genentech, Inc. and the source code shown in 

55 Table 1 below has been filed with user documentation in the 
U.S. Copyright Office, Washington D.C., 20559, where it is 
registered under U.S. Copyright Registration No. 
TXU5 10087. The ALIGN-2 program is publicly available 
through Genentech, Inc., South San Francisco, Calif, or may 

60 be compiled from the source code provided in Table 1 below. 
The ALIGN-2 program should be compiled for use on a 
UNIX operating system, preferably digital UNIX V4.0D. 
All sequence comparison parameters are set by the 
ALIGN-2 program and do not vary. 

65 In situations where ALIGN-2 is employed for amino acid 
sequence comparisons, the % amino acid sequence identity 
of a given amino acid sequence A to, with, or against a given 



US 6,7 

13 

amino acid sequence B (which can alternatively be phrased 
as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against 
a given amino acid sequence B) is calculated as follows: 

100 times the fraction XfY 

where X is the number of amino acid residues scored as 
identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of A and B, and 
where Y is the total number of amino acid residues in B. It 
will be appreciated that where the length of amino acid 
sequence A is not equal to the length of amino acid sequence 
B, the % amino acid sequence identity of A to B will not 
equal the % amino acid sequence identity of B to A. As 
examples of % amino acid sequence identity calculations 
using this method, Tables 2 and 3 demonstrate how to 
calculate the % amino acid sequence identity of the amino 
acid sequence designated "Comparison Protein" to the 
amino acid sequence designated "PRO", wherein "PRO" 
represents the amino acid sequence of a hypothetical PRO 
polypeptide of interest, "Comparison Protein" represents the 
amino acid sequence of a polypeptide against which the 
"PRO" polypeptide of interest is being compared, and "X", 
"Y" and "Z" each represent different hypothetical amino 
acid residues. 

Unless specifically stated otherwise, all % amino acid 
sequence identity values used herein are obtained as 
described in the immediately preceding paragraph using the 
ALIGN-2 computer program. However, % amino acid 
sequence identity values may also be obtained as described 
below by using the WU-BLAST-2 computer program 
(Altschul et aL, Methods in Enzymology 266:460-480 

(1996) ). Most of the WU-BLAST-2 search parameters are 
set to the default values. Those not set to default values, i.e., 
the adjustable parameters, are set with the following values: 
overlap span=l, overlap fraction=0.125, word threshold 
(T)»ll, and scoring matrix=BLOSUM62. When 
WU-BLAST-2 is employed, a % amino acid sequence 
identity value is determined by dividing (a) the number of 
matching identical amino acid residues between the amino 
acid sequence of the PRO polypeptide of interest having a 
sequence derived from the native PRO polypeptide and the 
comparison amino acid sequence of interest (i.e., the 
sequence against which the PRO polypeptide of interest is 
being compared which may be a PRO variant polypeptide) 
as determined by WU-BLAST-2 by (b) the total number of 
amino acid residues of the PRO polypeptide of interest. For 
example, in the statement "a polypeptide comprising an the 
amino acid sequence A which has or having at least 80% 
amino acid sequence identity to the amino acid sequence B", 
the amino acid sequence A is the comparison amino acid 
sequence of interest and the amino acid sequence B is the 
amino acid sequence of the PRO polypeptide of interest. 

Percent amino acid sequence identity may also be deter- 
mined using the sequence comparison program NCBI- 
BLAST2 (Altschul et aL, Nucleic Acids Res. 25:3389-3402 

(1997) ). The NCBI-BLAST2 sequence comparison program 
may be downloaded from http://www.ncbi.nlm.nih.gov or 
otherwise obtained from the National Institute of Health, 
Bethesda, Md. NCBI-BLAST2 uses several search 
parameters, wherein all of those search parameters are set to 
default values including, for example, unmask=yes, strand* 
all, expected occurrences- 10, minimum low complexity 
length=15/5, multi-pass e-value=0.01, constant for multi- 
pass=25, dropoff for final gapped alignment=25 and scoring 
matrix=BLOSUM62. 

In situations where NCBI-BLAST2 is employed for 
amino acid sequence comparisons, the % amino acid 
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sequence identity of a given amino acid sequence A to, with, 
or against a given amino acid sequence B (which can 
alternatively be phrased as a given amino acid sequence A 
that has or comprises a certain % amino acid sequence 
5 identity to, with, or against a given amino acid sequence B) 
is calculated as follows: 

100 times the fraction XfY 

where X is the number of amino acid residues scored as 

io identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of A and B, and 
where Y is the total number of amino acid residues in B. It 
will be appreciated that where the length of amino acid 
sequence A is not equal to the length of amino acid sequence 

15 B, the % amino acid sequence identity of A to B will not 
equal the % amino acid sequence identity of B to A. 

"PR019598 variant polynucleotide" or "PR019598 vari- 
ant nucleic acid sequence" means a nucleic acid molecule 
which encodes an active PRO 195 98 polypeptide as defined 

20 below and which has at least about 80% nucleic acid 
sequence identity with a nucleotide acid sequence encoding 
a full-length native sequence PR019598 polypeptide 
sequence as disclosed herein, a full-length native sequence 
PRO 19598 polypeptide sequence lacking the signal peptide 

25 as disclosed herein, an extracellular domain of a PR019598 
polypeptide, with or without the signal peptide, as disclosed 
herein or any other fragment of a full-length PRO 19598 
polypeptide sequence as disclosed herein. Ordinarily, a 
PR019598 variant polynucleotide will have at least about 

30 80% nucleic acid sequence identity, alternatively at least 
about 81% nucleic acid sequence identity, alternatively at 
least about 82% nucleic acid sequence identity, alternatively 
at least about 83% nucleic acid sequence identity, alterna- 
tively at least about 84% nucleic acid sequence identity, 

35 alternatively at least about 85% nucleic acid sequence 
identity, alternatively at least about 86% nucleic acid 
sequence identity, alternatively at least about 87% nucleic 
acid sequence identity, alternatively at least about 88% 
nucleic acid sequence identity, alternatively at least about 

40 89% nucleic acid sequence identity, alternatively at least 
about 90% nucleic acid sequence identity, alternatively at 
least about 91% nucleic acid sequence identity, alternatively 
at least about 92% nucleic acid sequence identity, alterna- 
tively at least about 93% nucleic acid sequence identity, 

45 alternatively at least about 94% nucleic acid sequence 
identity, alternatively at least about 95% nucleic acid 
sequence identity, alternatively at least about 96% nucleic 
acid sequence identity, alternatively at least about 97% 
nucleic acid sequence identity, alternatively at least about 

50 98% nucleic acid sequence identity and alternatively at least 
about 99% nucleic acid sequence identity with a nucleic acid 
sequence encoding a full-length native sequence PRO 195 98 
polypeptide sequence as disclosed herein, a full-length 
native sequence PRO 19598 polypeptide sequence lacking 

55 the signal peptide as disclosed herein, an extracellular 
domain of a PR019598 polypeptide, with or without the 
signal sequence, as disclosed herein or any other fragment of 
a full-length PRO 195 98 polypeptide sequence as disclosed 
herein. Variants do not encompass the native nucleotide 

60 sequence. 

Ordinarily, PRO 19598 variant polynucleotides are at least 
about 30 nucleotides in length, alternatively at least about 60 
nucleotides in length, alternatively at least about 90 nucle- 
otides in length, alternatively at least about 120 nucleotides 
65 in length, alternatively at least about 150 nucleotides in 
length, alternatively at least about 180 nucleotides in length, 
alternatively at least about 210 nucleotides in length, alter- 
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natively at least about 240 nucleotides in length, alterna- 
tively at least about 270 nucleotides in length, alternatively 
at least about 300 nucleotides in length, alternatively at least 
about 450 nucleotides in length, alternatively at least about 
600 nucleotides in length, alternatively at least about 900 
nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect 
to PRO-encoding nucleic acid sequences identified herein is 
defined as the percentage of nucleotides in a candidate 
sequence that are identical with the nucleotides in the PRO 
nucleic acid sequence of interest, after aligning the 
sequences and introducing gaps, if necessary, to achieve the 
maximum percent sequence identity. Alignment for pur- 
poses of determining percent nucleic acid sequence identity 
can be achieved in various ways that are within the skill in 
the art, for instance, using publicly available computer 
software such as BLAST, BLAST-2, ALIGN or Megalign 
(DNASTAR) software. For purposes herein, however, % 
nucleic acid sequence identity values are generated using the 
sequence comparison computer program ALIGN-2, wherein 
the complete source code for the ALIGN-2 program is 
provided in Table 1 below. The ALIGN-2 sequence com- 
parison computer program was authored by Genentech, Inc. 
and the source code shown in Table 1 below has been filed 
with user documentation in the U.S. Copyright Office, 
Washington D.C., 20559, where it is registered under U.S. 
Copyright Registration No. TXU510087. The ALIGN-2 
program is publicly available through Genentech, Inc., 
South San Francisco, Calif, or may be compiled from the 
source code provided in Table 1 below. The ALIGN-2 
program should be compiled for use on a UNIX operating 
system, preferably digital UNIX V4.0D. All sequence com- 
parison parameters are set by the ALIGN-2 program and do 
not vary. 

In situations where ALIGN-2 is employed for nucleic acid 
sequence comparisons, the % nucleic acid sequence identity 
of a given nucleic acid sequence C to, with, or against a 
given nucleic acid sequence D (which can alternatively be 
phrased as a given nucleic acid sequence C that has or 
comprises a certain % nucleic acid sequence identity to, 
with, or against a given nucleic acid sequence D) is calcu- 
lated as follows: 

100 times the fraction WfZ 

where W is the number of nucleotides scored as identical 
matches by the sequence alignment program ALIGN-2 in 
that program's alignment of C and D, and where Z is the 
total number of nucleotides in D. It will be appreciated that 
where the length of nucleic acid sequence C is not equal to 
the length of nucleic acid sequence D, the % nucleic acid 
sequence identity of C to D will not equal the % nucleic acid 
sequence identity of D to C. As examples of % nucleic acid 
sequence identity calculations, Tables 4 and 5, demonstrate 
how to calculate the % nucleic acid sequence identity of the 
nucleic acid sequence designated "Comparison DNA" to the 
nucleic acid sequence designated "PRO-DNA", wherein 
"PRO-DNA" represents a hypothetical PRO-encoding 
nucleic acid sequence of interest, "Comparison DNA" rep- 
resents the nucleotide sequence of a nucleic acid molecule 
against which the "PRO-DNA" nucleic acid molecule of 
interest is being compared, and "N", "L" and "V" each 
represent different hypothetical nucleotides. 

Unless specifically stated otherwise, all % nucleic acid 
sequence identity values used herein are obtained as 
described in the immediately preceding paragraph using the 
ALIGN-2 computer program. However, % nucleic acid 
sequence identity values may also be obtained as described 



below by using the WU- BLAST-2 computer program 
(Altschul et al., Methods in Enzymology 266:460-480 
(1996)). Most of the WU-BLAST-2 search parameters are 
set to the default values. Those not set to default values, i.e., 

5 the adjustable parameters, are set with the following values: 
overlap span=l, overlap firaction=0.125, word threshold 
(T)=ll, and scoring matrix=BLOSUM62. When 
WU-BLAST-2 is employed, a % nucleic acid sequence 
identity value is determined by dividing (a) the number of 

10 matching identical nucleotides between the nucleic acid 
sequence of the PRO polypeptide-encoding nucleic acid 
molecule of interest having a sequence derived from the 
native sequence PRO polypeptide-encoding nucleic acid and 
the comparison nucleic acid molecule of interest (i.e., the 

15 sequence against which the PRO polypeptide-encoding 
nucleic acid molecule of interest is being compared which 
may be a variant PRO polynucleotide) as determined by 
WU-BLAST-2 by (b) the total number of nucleotides of the 
PRO polypeptide-encoding nucleic acid molecule of inter- 

20 est. For example, in the statement "an isolated nucleic acid 
molecule comprising a nucleic acid sequence A which has or 
having at least 80% nucleic acid sequence identity to the 
nucleic acid sequence B", the nucleic acid sequence A is the 
comparison nucleic acid molecule of interest and the nucleic 

25 acid sequence B is the nucleic acid sequence of the PRO 
polypeptide-encoding nucleic acid molecule of interest. 

Percent nucleic acid sequence identity may also be deter- 
mined using the sequence comparison program NCBI- 
BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 

30 (1997)). The NCBI-BLAST2 sequence comparison program 
may be downloaded from http://www.ncbi.nlm.nih.gov or 
otherwise obtained from the National Institute of Health, 
Bethesda, Md. NCBI-BLAST2 uses several search 
parameters, wherein all of those search parameters are set to 

35 default values including, for example, unmask=yes, strand« 
all, expected occurrences=10, minimum low complexity 
length=15/5, multi-pass e-value =0.01, constant for multi- 
pass=25, dropoff for final gapped alignment=25 and scoring 
matrix=BLOSUM62. 

40 In situations where NCBI-BLAST2 is employed for 
sequence comparisons, the % nucleic acid sequence identity 
of a given nucleic acid sequence C to, with, or against a 
given nucleic acid sequence D (which can alternatively be 
phrased as a given nucleic acid sequence C that has or 

45 comprises a certain % nucleic acid sequence identity to, 
with, or against a given nucleic acid sequence D) is calcu- 
lated as follows: 

100 times the fraction W/Z 

50 where W is the number of nucleotides scored as identical 
matches by the sequence alignment program NCBI- 
BLAST2 in that program's alignment of C and D, and where 
Z is the total number of nucleotides in D. It will be 
appreciated that where the length of nucleic acid sequence C 

55 is not equal to the length of nucleic acid sequence D, the % 
nucleic acid sequence identity of C to D will not equal the 
% nucleic acid sequence identity of D to C. 

In other embodiments, PR019598 variant polynucle- 
otides are nucleic acid molecules that encode an active 

60 PR019598 polypeptide and which are capable of 
hybridizing, preferably under stringent hybridization and 
wash conditions, to nucleotide sequences encoding a full- 
length PR019598 polypeptide as disclosed herein. 
PR019598 variant polypeptides may be those that are 

65 encoded by a PR019598 variant polynucleotide. 

"Isolated, " when used to describe the various polypep- 
tides disclosed herein, means polypeptide that has been 
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identified and separated and/or recovered from a component "Stringency" of hybridization reactions is readily deter- 

of its natural environment. Contaminant components of its minable by one of ordinary skill in the art, and generally is 

natural environment are materials that would typically inter- an empirical calculation dependent upon probe length, 

fere with diagnostic or therapeutic uses for the polypeptide, washing temperature, and salt concentration. In general, 

and may include enzymes, hormones, and other proteina- 5 longer probes require higher temperatures for proper 

ceous or non-proteinaceous solutes. In preferred annealing, while shorter probes need lower temperatures, 

embodiments, the polypeptide will be purified (1) to a Hybridization generally depends on the ability of denatured 

degree sufficient to obtain at least 15 residues of N-terminal DNA to reanneal when complementary strands are present in 

or internal amino acid sequence by use of a spinning cup an environment below their melting temperature. The higher 

sequenator, or (2) to homogeneity by SDS-PAGE under 10 the degree of desired homology between the probe and 

non-reducing or reducing conditions using Coomassie blue hybridizable sequence, the higher the relative temperature 

or, preferably, silver stain. Isolated polypeptide includes which can be used. As a result, it follows that higher relative 

polypeptide in situ within recombinant cells, since at least temperatures would tend to make the reaction conditions 

one component of the PRO 19598 polypeptide natural envi- more stringent, while lower temperatures less so. For addi- 

ronment will not be present. Ordinarily, however, isolated 15 tional details and explanation of stringency of hybridization 

polypeptide will be prepared by at least one purification step. reactions, see Ausubel et al., Current Protocols in Molecular 

An "isolated" PR019598 polypeptide-encoding nucleic Biology, Wiley Interscience Publishers, (1995). 

acid or other polypeptide-encoding nucleic acid is a nucleic "Stringent conditions" or "high stringency conditions", as 

acid molecule that is identified and separated from at least defined herein, may be identified by those that: (1) employ 

one contaminant nucleic acid molecule with which it is 20 low ionic strength and high temperature for washing, for 

ordinarily associated in the natural source of the example 0.015 M sodium chloride/0.0015 M sodium citrate/ 

polypeptide-encoding nucleic acid. An isolated polypeptide- 0.1% sodium dodecyl sulfate at 50° C; (2) employ during 

encoding nucleic acid molecule is other than in the form or hybridization a denaturing agent, such as formamide, for 

setting in which it is found in nature. Isolated polypeptide- example, 50% (v/v) formamide with 0.1% bovine serum 

encoding nucleic acid molecules therefore are distinguished 25 albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM 

from the specific polypeptide-encoding nucleic acid mol- sodium phosphate buffer at pH 6.5 with 750 mM sodium 

ecule as it exists in natural cells. However, an isolated chloride, 75 mM sodium citrate at 42° C; or (3) employ 50% 

polypeptide-encoding nucleic acid molecule includes formamide, 5xSSC (0.75 M NaCl, 0.075 M sodium citrate), 

polypeptide-encoding nucleic acid molecules contained in 50 mM sodium phosphate (pH 6.8), 0.1% sodium 

cells that ordinarily express the polypeptide where, for 30 pyrophosphate, SxDenhardt's solution, sonicated salmon 

example, the nucleic acid molecule is in a chromosomal sperm DNA (50 //g/ml), 0.1% SDS, and 10% dextran sulfate 

location different from that of natural cells. at 42° C, with washes at 42° C. in 0.2xSSC (sodium 

The term "control sequences" refers to DNA sequences chloride/sodium citrate) and 50% formamide at 55° C, 

necessary for the expression of an operably linked coding followed by a high-stringency wash consisting of O.lxSSC 

sequence in a particular host organism. The control 35 containing EDTA at 55° C 

sequences that are suitable for prokaryotes, for example, "Moderately stringent conditions" may be identified as 

include a promoter, optionally an operator sequence, and a described by Sambrook et al., Molecular Cloning: A Labo- 

ribosome binding site. Eukaryotic cells are known to utilize ratory Manual, New York: Cold Spring Harbor Press, 1989, 

promoters, polyadenylation signals, and enhancers. and include the use of washing solution and hybridization 

Nucleic acid is "operably linked" when it is placed into a 40 conditions (e.g., temperature, ionic strength and % SDS) less 

functional relationship with another nucleic acid sequence. stringent that those described above. An example of mod- 

For example, DNA for a presequence or secretory leader is erately stringent conditions is overnight incubation at 37° C. 

operably linked to DNA for a polypeptide if it is expressed in a solution comprising: 20% formamide, 5xSSC (150 mM 

as a prep rote in that participates in the secretion of the NaQ, 15 mM trisodium citrate), 50 mM sodium phosphate 

polypeptide; a promoter or enhancer is operably linked to a 45 (pH 7.6), 5xDenhardt's solution, 10% dextran sulfate, and 

coding sequence if it affects the transcription of the 20 mg/ml denatured sheared salmon sperm DNA, followed 

sequence; or a ribosome binding site is operably linked to a by washing the filters in lxSSC at about 37-50° C. The 

coding sequence if it is positioned so as to facilitate trans- skilled artisan will recognize how to adjust the temperature, 

lation. Generally, "operably linked" means that the DNA ionic strength, etc. as necessary to accommodate factors 

sequences being linked are contiguous, and, in the case of a 50 such as probe length and the like. 

secretory leader, contiguous and in reading phase. However, The term "epitope tagged" when used herein refers to a 

enhancers do not have to be contiguous. Linking is accom- chimeric polypeptide comprising a PR019598 polypeptide 

plished by ligation at convenient restriction sites. If such fused to a "tag polypeptide". The tag polypeptide has 

sites do not exist, the synthetic oligonucleotide adaptors or enough residues to provide an epitope against which an 

linkers are used in accordance with conventional practice. 55 antibody can be made, yet is short enough such that it does 

The term "antibody" is used in the broadest sense and not interfere with activity of the polypeptide to which it is 

specifically covers, for example, single anti-PR019598 fused. The tag polypeptide preferably also is fairly unique so 

monoclonal antibodies (including agonist, antagonist, and that the antibody does not substantially cross-react with 

neutralizing antibodies), anti-PR0l9598 antibody composi- other epitopes. Suitable tag polypeptides generally have at 

tions with polyepitopic specificity, single chain anti- 60 least six amino acid residues and usually between about 8 

PR019598 antibodies, and fragments of anti-PR019598 and 50 amino acid residues (preferably, between about 10 

antibodies (see below). The term "monoclonal antibody" as and 20 amino acid residues). 

used herein refers to an antibody obtained from a population As used herein, the term "immunoadhesin" designates 

of substantially homogeneous antibodies, i.e., the individual antibody-like molecules which combine the binding speci- 

antibodies comprising the population are identical except for 65 ficity of a heterologous protein (an "adhesin") with the 

possible naturally-occurring mutations that may be present effector functions of immunoglobulin constant domains, 

in minor amounts. Structurally, the immunoadhesins comprise a fusion of an 
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amino acid sequence with the desired binding specificity 
which is other than the antigen recognition and binding site 
of an antibody (i.e., is "heterologous"), and an immunoglo- 
bulin constant domain sequence. The adhesin part of an 
immunoadhesin molecule typically is a contiguous amino 
acid sequence comprising at least the binding site of a 
receptor or a ligand. The immunoglobulin constant domain 
sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 
subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD or 
IgM. 

"Active" or "activity" for the purposes herein refers to 
form(s) of a PR019598 polypeptide which retain a biologi- 
cal and/or an immunological activity of native or naturally- 
occurring PR019598, wherein "biological" activity refers to 
a biological function (either inhibitory or stimulatory) 
caused by a native or naturally-occurring PRO 19598 other 
than the ability to induce the production of an antibody 
against an antigenic epitope possessed by a native or 
naturally-occurring PR019598 and an "immunological" 
activity refers to the ability to induce the production of an 
antibody against an antigenic epitope possessed by a native 
or naturally-occurring PRO 19598. 

The term "antagonist" is used in the broadest sense, and 
includes any molecule that partially or fully blocks, inhibits, 
or neutralizes a biological activity of a native PRO 19598 
polypeptide disclosed herein. In a similar manner, the term 
"agonist" is used in the broadest sense and includes any 
molecule that mimics a biological activity of a native 
PRO 195 98 polypeptide disclosed herein. Suitable agonist or 
antagonist molecules specifically include agonist or antago- 
nist antibodies or antibody fragments, fragments or amino 
acid sequence variants of native PR019598 polypeptides, 
peptides, antisense oligonucleotides, small organic 
molecules, etc. Methods for identifying agonists or antago- 
nists of a PR019598 polypeptide may comprise contacting 
a PRO 19598 polypeptide with a candidate agonist or antago- 
nist molecule and measuring a detectable change in one or 
more biological activities normally associated with the 
PR019598 polypeptide. 

"Treatment" refers to both therapeutic treatment and 
prophylactic or preventative measures, wherein the object is 
to prevent or slow down (lessen) the targeted pathologic 
condition or disorder. Those in need of treatment include 
those already with the disorder as well as those prone to have 
the disorder or those in whom the disorder is to be prevented. 

"Chronic" administration refers to administration of the 
agent(s) in a continuous mode as opposed to an acute mode, 
so as to maintain the initial therapeutic effect (activity) for 
an extended period of time. "Intermittent" administration is 
treatment that is not consecutively done without 
interruption, but rather is cyclic in nature. 

"Mammal" for purposes of treatment refers to any animal 
classified as a mammal, including humans, domestic and 
farm animals, and zoo, sports, or pet animals, such as dogs, 
cats, cattle, horses, sheep, pigs, goats, rabbits, etc. 
Preferably, the mammal is human. 

Administration "in combination with" one or more further 
therapeutic agents includes simultaneous (concurrent) and 
consecutive administration in any order. 

"Carriers" as used herein include pharmaceutical^ 
acceptable carriers, excipients, or stabilizers which are non- 
toxic to the cell or mammal being exposed thereto at the 
dosages and concentrations employed. Often the physiologi- 
cally acceptable carrier is an aqueous pH buffered solution. 
Examples of physiologically acceptable carriers include 
buffers such as phosphate, citrate, and other organic acids; 



antioxidants including ascorbic acid; low molecular weight 
(less than about 10 residues) polypeptide; proteins, such as 
serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids such as 

5 glycine, glutamine, asparagine, arginine or lysine; 
monosaccharides, disaccharides, and other carbohydrates 
including glucose, mannose, or dextrins; chelating agents 
such as EDTA; sugar alcohols such as mannitol or sorbitol; 
salt-forming counterions such as sodium; and/or nonionic 

10 surfactants such as TWEEN™, polyethylene glycol (PEG), 
and PLURONICS™. 

"Antibody fragments" comprise a portion of an intact 
antibody, preferably the antigen binding or variable region 
of the intact antibody. Examples of antibody fragments 

15 include Fab, Fab', F(ab') 2 , and Fv fragments; diabodies; 
linear antibodies (Zapata et al., Protein Eng. 8(10): 
1057-1062 [1995]); single-chain antibody molecules; and 
multispecific antibodies formed from antibody fragments. 
Papain digestion of antibodies produces two identical 

20 antigen -binding fragments, called "Fab" fragments, each 
with a single antigen-binding site, and a residual "Fc" 
fragment, a designation reflecting the ability to crystallize 
readily. Pepsin treatment yields an F(ab') 2 fragment that has 
two antigen-combining sites and is still capable of cross- 

25 linking antigen. 

"Fv" is the minimum antibody fragment which contains a 
complete antigen-recognition and -binding site. This region 
consists of a dimer of one heavy- and one light-chain 
variable domain in tight, non-covalent association. It is in 

30 this configuration that the three CDRs of each variable 
domain interact to define an antigen-binding site on the 
surface of the V^V^ dimer. Collectively, the six CDRs 
confer antigen-binding specificity to the antibody. However, 
even a single variable domain (or half of an Fv comprising 

35 only three CDRs specific for an antigen) has the ability to 
recognize and bind antigen, although at a lower affinity than 
the entire binding site. 

The Fab fragment also contains the constant domain of 
the light chain and the first constant domain (CHI) of the 

40 heavy chain. Fab fragments differ from Fab' fragments by 
the addition of a few residues at the carboxy terminus of the 
heavy chain CHI domain including one or more cysteines 
from the antibody hinge region. Fab'-SH is the designation 
herein for Fab' in which the cysteine residue(s) of the 

45 constant domains bear a free thiol group. F(ab') 2 antibody 
fragments originally were produced as pairs of Fab' frag- 
ments which have hinge cysteines between them. Other 
chemical couplings of antibody fragments are also known. 
The "light chains" of antibodies (immunoglobulins) from 

50 any vertebrate species can be assigned to one of two clearly 
distinct types, called kappa and lambda, based on the amino 
acid sequences of their constant domains. 

Depending on the amino acid sequence of the constant 
domain of their heavy chains, immunoglobulins can be 

55 assigned to different classes. There are five major classes of 
immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several 
of these may be further divided into subclasses (isotypes), 
e.g., IgGl, IgG2, IgG3, IgG4, IgA, and IgA2. 

"Single-chain Fv" or "sFv" antibody fragments comprise 

60 the and domains of antibody, wherein these domains 
are present in a single polypeptide chain. Preferably, the Fv 
polypeptide further comprises a polypeptide linker between 
the V H and V L domains which enables the sFv to form the 
desired structure for antigen binding. For a review of sFv, 

65 see Pluckthun in The Pharmacology of Monoclonal 
Antibodies, vol. 113, Rosenburg and Moore eds., Springer- 
Verlag, New York, pp. 269-315 (1994). 
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The term "diabodies" refers to small antibody fragments epitope on a particular polypeptide without substantially 
with two antigen-binding sites, which fragments comprise a binding to any other polypeptide or polypeptide epitope, 
heavy-chain variable domain (V„) connected to a light- The word "label" when used herein refers to a detectable 
chain variable domain (V £ ) in the same polypeptide chain compound or composition which is conjugated directly or 
(V /r -VJ. By using a linker that is too short to allow pairing 5 indirectly to the antibody so as to generate a "labeled" 
between the two domains on the same chain, the domains are antibody. The label may be detectable by itself (e.g. radio- 
forced to pair with the complementary domains of another isotope labels or fluorescent labels) or, in the case of an 
chain and create two antigen-binding sites. Diabodies are enzymatic label, may catalyze chemical alteration of a 
described more fully in, for example, EP 404,097; WO substrate compound or composition which is detectable. 
93/11161; and Hollinger et al., Proc. NatL Acad ScL USA, 10 f . B ? 15 me * m a ^aqueous matrix to which 
00-6444 aaaq /i gga\ the antJ b°dy °* tne present invention can adhere. Examples 
* ... . .?* , . , , . . of solid phases encompassed herein include those formed 
An isolated antibody is one which has been identified ^ Qf of ^ ( mUolM ^ g^), 

and separated and/or recovered from a component of its polysaccharides (e.g., agarose), polyacrylamides, 

natural environment. Contaminant components of its natural polystyrene, polyvinyl alcohol and silicones. In certain 

environment are materials which would interfere with diag- 15 embodiments, depending on the context, the solid phase can 

nostic or therapeutic uses for the antibody, and may include comprise the well of an assay plate; in others it is a 

enzymes, hormones, and other proteinaceous or nonpro- purification column (e.g., an affinity chromatography 

teinaceous solutes. In preferred embodiments, the antibody column). This term also includes a discontinuous solid phase 

will be purified (1) to greater than 95% by weight of of discrete particles, such as those described in U.S. Pat. No. 

antibody as determined by the Lowry method, and most 20 4,275,149. 

preferably more than 99% by weight, (2) to a degree A "liposome" is a small vesicle composed of various 

sufficient to obtain at least 15 residues of N-terminal or types of lipids, phospholipids and/or surfactant which is 

internal amino acid sequence by use of a spinning cup useful for delivery of a drug (such as a PR019598 polypep- 

sequenator, or (3) to homogeneity by SDS-PAGE under tide or antibody thereto) to a mammal. The components of 

reducing or nonreducing conditions using Coomassie blue 25 the liposome are commonly arranged in a bilayer formation, 

or, preferably, silver stain. Isolated antibody includes the similar to the lipid arrangement of biological membranes, 

antibody in situ within recombinant cells since at least one A "small molecule" is defined herein to have a molecular 

component of the antibody's natural environment will not be weight below about 500 Daltons. 

present. Ordinarily, however, isolated antibody will be pre- An "effective amount" of a polypeptide disclosed herein 

pared by at least one purification step. 30 or an agonist or antagonist thereof is an amount sufficient to 

An antibody that "specifically binds to" or is "specific carry out a specifically stated purpose. An "effective 

for" a particular polypeptide or an epitope on a particular amount" may be determined empirically and in a routine 

polypeptide is one that binds to that particular polypeptide or manner, in relation to the stated purpose. 



TABLE 1 



r 

* 

* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is _JA; stop-stop = 0; J (joker) match = 0 
7 

#define _M -8 /* value of a match with a stop */ 

int _day(26][26] - { 

/• ABCDEFGHIJKLMNOPQRSTUVWXYZ7 



/*A*/ 
/'BV 

rev 
r d 7 

/* E 7 
r F7 
/•G7 
/•H7 

n 7 

/•J 7 
/•K7 
/* L 7 
r M 7 
/* N 7 

rov 
r p 7 

/*Q7 

r r 7 

/•S7 
/•TV 

/• u v 

/• V7 

/•xv 

/• Y7 
/*Z7 

}; 



2, 0, -2, 0, 0, -4, 1, -1, -1, 0, -1, -2, -1, 0, _M, 1, 0, -2, 1, 1, 0, 0, -6, 0, -3, 0}, 
0, 3, -4, 3, 2, -5, 0, 1, -2, 0, 0, -3, -2, 2, _M, -1, 1, 0, 0, 0, 0, -2, -5, 0, -3, 1}, 

2, -4, 15, -5, -5, -4, -3, -3, -2, 0, -5, -6, -5, -4, _M, -3, -5, -4, 0, -2, 0, -2, -8, 0, 0, -5}, 
0, 3, -5, 4, 3, -6, 3, 1, -2, 0, 0, -4, -3, 2, __M, -1, 2, -1, 0, 0, 0, -2, -7, 0, -4, 2}, 
0, 2, -5, 3, 4, -5, 0, 3, -2, 0, 0, -3, -2, 1, _M, -1, 2, -1, 0, 0, 0, -2, -7, 0, -4, 3}, 

4, -5, -4, -6, -5, 9, -5, -2, 1, 0, -5, 2, 0, -4, _M, -5, -5, -4, -3, -3, 0, -3, 0, 0, 7, -5}, 

3, 0, -3, 1, 0, -5, 5, -2, -3, 0, -2, -4, -3, 0, _M, -1, -1, -3, 1, 0, 0, -1, -7, 0, -5, 0}, 
-1, 1, -3, 1, 1, -2, -2, 6, -2, 0, 0, -2, -2, 2, _M, 0, 3, 2, -1, -3, 0, -2, -3, 0, 0, 2}, 

-1, -2, -2, -2, -2, 1, -3, -2, 5, 0, -2, 2, 2, -2, _M, -2, -2, -2, -1, 0, 0, 4, -5, 0, -1, -2}, 
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, _M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
-1, 0, -5, 0, 0, -5, -2, 0, -2, 0, 5, -3, 0, 3, _M, -1, 3, 3, 0, 0, 0, -2, -3, 0, -4, 0}, 
-2, -3, -6, -4, -3, 2, -4, -2, 2, 0, -3, 6, 4, -3, _M, -3, -2, -3, -3 , -1, 0, 2, -2, 0, -1, -2} 
1, -2, -5, -3, -2, 0, -3, -2, 2, 0, 0, 4, 6, -2, _M, -2, -1, 0, -2, -1, 0, 2, -4, 0, -2, -1}, 

0, 2, -4, 2, 1, -4, 0, 2, -2, 0, 1, -3, -2, 2, _M, -1, 1, 0, 1, 0, 0, -2, -4, 0, -2, 1}, 

_M_M_M>_M_M_M__M, M_M_M__M_M_M__M, 0,_M,_M,_M,_M,_M,_M,_M,_>1,_M,_M,_M,} 3 

1, -1, -3, -1, -1, -5, -1, 0, -2, 0, -3, -3, -2, -1_M, 6, 0, 0, 3, 0, 0, -1, -6, 0, -5, 0}, 

0, 1, -5, 2, 2, -5, -1, 3, -2, 0, 3, -2, -3, 3, _M, 0, 4, 1, -3, -1, 0, -2, -5, 0, -4, 3}, 
-2, 0, -4, -3, -1, -4, -3, 2, -2, 0, 3, -3, 0, 0, _M, 0, 1, 6, 0, -3, 0, -2, 2, 0, -4, 0}, 

1, 0, 0, 0, 0, -3, 1, -3, -3, 0, 0, -3, -2, 1, _M, 1, -1, 0, 2, 1, 0, -1, -2, 0, -3, 0}, 
3, 0, -2, 0, 0, -3, 0, -1, 0, 0, 0, -3, -1, 0, _M, 0, -3, -3, 3, 3, 0, 0, -5, 0, -3, 0}, 
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

0, -2, -2, -2, -2, -3, -3, -2, 4, 0, -2, 2, 2, -2_M, -3, -2, -2, -3, 0, 0, 4, -6, 0, -2, -2}, 
-6, -5, -8, -7, -7, 0, -7, -3, -5, 0, -3, -2, -4, -4_M, -6, -5, 2, -2, -5, 0, -6, 37, 0, 0, -6}, 
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, _M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
■3, -3, 0, -4, -4, 7, -5, 0, -3, 0, -4, -3, -2, -2, _M, -5, -4, -4, -3, -3, 0, -2, 0, 0, 30, -4}, 
{0, 3, -5, 2, 3, -5, 0, 2, -2, 0, 0, -2, -3, 1_M, 0, 3, 0, 0, 0, 0, -2, -6, 0, -4, 4}, 
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TABLE 1 -continued 



tfinclude <stdio.h> 
#includc <ctypc.b> 
#dcfine MAXJMP 
#define MAXGAP 
#define JMPS 
#definc MX 
#define DMAT 
#define DMIS 
#define DINSO 
#define DINS1 
#define PINSO 
#deflne PINS1 
struct jmp { 
short 

unsigned short 



16 /' max jumps in a diag •/ 

24 /* don't continue to penalize gaps largCT than this */ 

1024 /• max jmps in an path */ 

4 /* save if there's at least MX-1 bases since last jmp 7 

3 /* value of matching bases */ 

0 /* penalty for mismatched bases "/ 
8 /* penalty for a gap */ 

1 /* penalty per base */ 
8 /* penalty for a gap •/ 

4 /* penalty per residue */ 



n[MAXJMPj 
xtMAXJMPj 



}; 

struct diag { 
int 
long 
short 

struct jmp 

}; 

struct path { 
int 
short 
int 

}; 



score; 
offset; 
ijmp; 

jp; 



spc; 

n[JMPSj 
xCJMPSj 



char *ofile; 

char *namex[2j; 

char *prog; 

char *seqx[2]; 

int dm ax; 

int dmaxO; 

int dna; 

int endgaps; 

int gapx, gapy; 

int lenO, lenl; 

int ngapx, ngapy; 

int smax; 

int *xbm; 

long offset; 

struct diag *dx; 

struct path pp[2]; 
char 
char 

/" Needleman-Wunsch alignment program 



*callocO, *malloc0, *index0, *strcpy0; 
•getseqQ, *g_calloc0; 



/* size of jmp (neg for dely) */ 
/* base no. of jmp in seq x "/ 
l m limits seq to 2 16-17 



/* score at last jmp 7 
/* offset of prev block •/ 
/* current jmp index 7 
/* list of jmps */ 



/* number of leading spaces */ 

/* size of jmp (gap) 7 

/* loc of jmp (last elem before gap) 7 

/* output file name */ 
/* seq names: getseqsO 7 
/* prog name for err msgs 7 

/* seqs: getseqsO "/ 
/* best diag: nwQ "/ 
/* final diag */ 
/* set if dna: main() */ 
/* set if penalizing end gaps */ 
/* total gaps in seqs 7 
/* seq lens */ 
/* total size of gaps •/ 
/* max score: nwQ 7 
/* bitmap for matching */ 
/* current offset in jmp file */ 
/* holds diagonals */ 
/* holds path for seqs */ 



* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with *;', 4 >* or *<* are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with l A or more of its elements ACGTXJ is assumed to be DNA 
" Output is in the file "align. out" 

■ 

■ The program may create a tmp file in Amp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
7 

#include "nw.h" 
#include "day.h" 
static _dbval[26] = { 

1,14,2,13,0,0,4,11,0,0,12,0^,15,0,0,0^,6,8,8,7^,0,10,0 

}; 

static _pbval[26] = { 

1, 2|(1< <('D'-*A'))|(1< <( 4 N'- 4 A')), 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1< <10, 1< <11, 1< <12, 1< <13, 1< <14, 
1< <15, 1< <16, 1< <17, 1< <18, 1< <19, 1< <20, 1< <21, 1< <22, 
1< <23, 1< <24, 1< <25|(1< <CE'-'A'))|(1< <CQ'-'A')) 

}; 

main(ac, av) 

int ac; 
char *av[]; 

{ 

prog - av(03; 
if(ac !- 3) { 

fprintf(stderT, "usage: %s filel file2\n", prog); 
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TABLE 1 -continued 



fprintf(stderr, "where filel and file2 are two dna or two protein sequences.W); 
fprint^stderr, "The sequences can be in upper- or lower-case\n"); 
fj)rintf(stdeiT, "Any lines beginning with *;* or *<' are ignored\n"); 
fjmntf(stderr, "Output is in the file \"align.out\"\n"); 
exit(l); 

} 

namex{0] - av(l]; 

namex[l] « av(2]; 

seqx{0] » getseq(namex[0], &len0); 

seqx[l] = gets eq (nam ex[l], &lenl); 

xbm ■ (dna)? _dbval : _pbval; 

endgaps - 0; /• 1 to penalize endgaps 7 

ofUe » "align.out"; /* output file */ 

nwO; /* fill in the matrix, get the possible jmps 7 

readjmpsO", /* get the actual jmps 7 

printO; /* print stats, alignment */ 

cleanup(0); /* unlink any tmp files 7 

} 

/* do the alignment, return best score: main() 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
•/ 

nw() 
{ 

char *px, *py; /* seqs and ptrs */ 

int 'ndely, *dely; /* keep track of dely */ 

int ndelx, delx; /* keep track of delx */ 

int *tmp; /* for swapping rowO, rowl */ 

int mis; /* score for each type 7 

int insO, insl; /* insertion penalties */ 

register id; /* diagonal index ■/ 

register ij; /* jmp index */ 

register *col0, *coll; /'score for curr, last row */ 

register xx, yy; /* index into seqs •/ 

dx = (struct diag *)g_calloc("to get diags", lenO+lenl+l, sizeof(struct diag)); 

ndely = (int *)g_calloc("to get ndely", lenl+1, sizeof(int)); 

dely - (int *)g_calloc("to get dely", lenl+1, sizeof(int)); 

colO - (int *)g_calloc("to get colO", lenl+1, sizeof(int)); 

coll = (int *)g_calloc("to get coll", lenl+1, sizeof(int)); 

insO - (dna)? DINS0 : PINS0; 

insl - (dna)? DINS1 : P1NS1; 

smax ° -10000; 

if (endgaps) { 

for (col0[0] = dely[0] - -insO, yy = 1; yy <= lenl; yy++) { 
col0[yy] = dely[yy] - colO[yy-l] - insl; 
ndelyfyy] - yy; 

} 

co!0[0] - 0; / m Waterman Bull Math Biol 84 •/ 

} 

else 

for (yy= 1; yy <= lenl; yy++) 
delyfyy] - -insO; 
/" fill in match matrix 
7 

for (px - seqxfOj xx « 1; xx <= lenO; px++, xx++) { 
/• initialize first entry in col 
7 

if (endgaps) { 
if (xx — 1) 

coll[0] = debt - -(insO+insl); 

else 

coll[0] = delx = colO[0]-insl; 
ndelx = xx; 

} 

else { 

coll[0] - 0; 
delx - -insO; 
ndelx - 0; 

} 

for (py = seqx[l], yy = 1; yy <= lenl; py++, yy++) { 
mis - col0[yy-l]; 
if (dna) 

mis + - (xbin{*px- 4 A']&xbm[*py-'A'])? DMAT : DMIS; 

else 

mis += _ day[ - px- 4 A'I*py-*A't 
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/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
•/ 

if (endgaps || ndely[yy] < MAXGAP) { 
if (col0[yy] - insO >= dely[yyD { 

dely(yy] = col0[yy] - (insO+insl); 

ndelylyy] - 1; 
}else{ 

dely(yy] -=> insl; 

ndely[yy}++; 

} 

}else{ 

if (col0[yy3 - (insO+insl) >- dely[yyj) { 

dely[yy] - col0[yy] - (insO+insl); 

ndely[yy] = 1 ; 
} else 

ndely[yy]++; 

} 

/* update penalty for del in y seq; 

* favor new del over ongong del 
•/ 

if (endgaps || ndelx < MAXGAP) { 
if(coll[yy-l] - insO >- delx) { 

delx = coll[yy-l] - (insO+insl); 

ndelx =1; 
}else{ 

delx — insl; 

ndelx++; 

} 

}else{ 

if (coll[yy-l] - (insO+insl) >= delx) { 

delx - coll[yy-l] - (insO+insl); 

ndelx - 1; 
} else 

ndelx++; 

} 

/• pick the maximum score; we're favoring 

* mis over any del and delx over dely 
7 

...nw 

id ■» xx - yy + lenl - 1; 

if (mis >= delx && mis >= delyfyy]) 

coll[yy] = mis; 
else if (delx >= delyfyy]) { 

coll[yy] = delx; 

ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna || (ndelx >= MAXJMP 
&& xx > dx(id].jp.x[ij>MX) || mis > dx[id].score+DINSO)) { 
dx(id].ijmp++; 
if (++ij >= MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp « 0; 
dxjidjoflfset - offset; 

offset += sizeof(struct jmp) + sizeof(offset); 

} 

} 

dx[id].jp.n[ij] = ndelx; 
dx[id].jp.x[ij] - xx; 
dx[id].score = delx; 

} 

else{ 

collfyy] delyfyy]; 
ij - dxf 'idjijmp; 
if (dx(id].jp.n[0] && (!dna || (ndelyfyy] >= MAXJMP 

&& xx > dx[id}jpjc[ij]+MX) || mis > dx(id].score+DINSO)) { 
dx[id].ijmp++; 
if (++ij >- MAXJMP) { 
writejmps(id); 
ij » dx[id].ijmp - 0; 
dx[id>offset - offset; 

offset +- sizeof(struct jmp) + sizeof (offset); 

} 

} 

dx[id].jp.n[ij] — ndely[yy]; 
dx[id].jp.x(ij] - xx; 
dx[id].score » delyfyyj 

} 

if (xx — lenO && yy < lenl) { 
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} 

r 



r last col 
•/ 

if (endgaps) 

coll[yy] — insO+insl *(lenl-yy); 
if(coll[yy] > smax) { 

smax = colljyyi 

dmax ■= id; 

} 

} 

} 

if (endgaps && xx < lenO) 

ooll[yy-l] -= insO+insl *(lenO-xx); 
if (coll[yy-l] > smax) { 

smax = coll[yy-l]; 

dmax - id; 

} 

tmp - colO; colO - coll; coll = Imp; 

} 

(void) frec((char *)ndely); 
(void) free((char *)dely); 
(void) free((char *)col0); 
(void) free((char *)coll); 



* printO - only routine visible outside this module 

* static: 

* getmatO -- trace back best path, count matches: printO 

* pr_align0 " print alignment of described in array p(]: printO 

* dumpblockO ~ dump a block of lines with numbers, stars: pr__align() 

* numsO - put out a number line: dumpblockO 

* putlineO - put out a line (name, [num], seq, [num]): dumpblockO 

* starsO - -put a line of stars: dumpblockO 

* stripnameO -- strip any path and prefix from a seqname 
V 

#include "nw.h" 

#define SPC 3 

#define P __LINE 256 /• maximum output line •/ 

#deftne P_SPC 3 /* space between name or num and seq */ 

extern _day[26][26]; 

int olen; /* set output line length */ 

FILE *fx; /* output file •/ 

printO print 

{ 

int lx, ly, firstgap, lastgap; /* overlap •/ 
if ((fx « fopen(ofile, M w")) — 0) { 

fprintf(stderr, "%s: can't write %s\n", prog, ofile); 

cleanup(l); 

} 

fprintf(fx, "<first sequence: %s (length = %d)\n", namex[0l lenO); 
fprintf(fx, "csecond sequence: %s (length = %d)\n", namexfl], lenl); 
olen = 60; 
lx = lenO; 
ly - lenl; 

firstgap » lastgap - 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 

pp[0].spc = firstgap - lenl - dmax - 1; 
ly -= pp[0].spc; 

} 

else if (dmax > lenl - 1) { /* leading gap in y •/ 
pp[l].spc " firstgap - dmax - (lenl - 1); 
lx -= pp[l].spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 

lastgap = lenO - dmaxO -1; 
lx -«=■ lastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap « dmaxO - (lenO - 1); 
ly — lastgap; 

} 

getmat(lx, ly, firstgap, lastgap); 
pr_align0; 

} 
/• 

* trace back the best path, count matches 
7 

static 
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getmat(lx, ly, firstgap, lastgap) getmat 
Lot be, ly; /* "core" (minus endgaps) */ 

inl firstgap, lastgap; /* leading trailing overlap */ 

{ 

int am, iO, il, sizO, sizl; 

char outx(32j 
double pet; 
register nO, nl; 

register char 'pO, *pl; 

/* get total matches, score 
7 

iO - il - sizO = sizl = 0; 
pO - seqx[03 + pp[l].spq 
pi - seqxfl] + ppfOjspc; 
nO -> pp[l].spc + 1; 
nl = pp[0].spc + 1; 
am = 0; 

whUe ( 'pO && *pl ) { 
if (sizO) { 
pl++; 
nl++; 
sizO — ; 

else if (sizl) { 
p0++; 
n0++; 
sizl — ; 

} 

else { 

if (xbm[*r^A*]&xbm[*pl- A']) 

nm++; 
if (n0++ — pp[0].x[iO]) 

sizO = ppfO].n[i&*-*-]; 
if (nl++ = pp[l]jc(ilD 

sizl = pp[l].n[U++]; 
pO++; 
pl++; 

} 

} 

/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 
7 

if (endgaps) 

lx = (lenO < lenl)? lenO : lenl; 

else 

lx = (lx < ly)? lx : ly; 
pet - 100.*(double)nm/(double)lx; 
fyrintf(fx, "\n"); 

fj)rintf(fx, "<%d match%s in an overlap of %d: %.2f percent similarity\n", 
nm, (nm «== 1)? : "es", lx, pet); 

fprintf(fx, "<gaps in first sequence: %d", gapx); ...getmat 
if (gapx) { 

(void) sprintf(outx, "(%d %s%s)", 

ngapx, (dna)? "base": "residue", (ngapx ~ 1)? ""iV); 

fprintf(6c, "% s", outx); 
fprintf(fx, gaps in second sequence: %d", gapy); 

if (gapy) { 

(void) sprintfi(outx, "(%d %s%s)", 

ngapy, (dna)? "base":"residue", (ngapy == 1)? **":"s"); 
fprintf(fx, "%s", outx); 

} 

if (dna) 

fprintf(fx, 

*'\n<score: %d (match » %d, mismatch = %d, gap penalty = %d + %d per base)\n", 
smax, DMAT, DMIS, DINSO, DINS1); 

else 

fprintf(6c, 

*An<score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per residue)\n", 
smax, PINS0, PINS1); 
if (endgaps) 
fprintf(fx, 

"<endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap — 1)? "" : "s", 
lastgap, (dna)? "base" : "residue", (lastgap — 1)? "" : "s"); 

else 

fprintf(fx, "<endgaps not penalized\n"); 

} 

static nm; /* matches in core — for checking 7 
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lmax; 
nc[2]; 

out[2lP_UNE]; 
starfP-LINEj 



static 
static 
static 
static 
static 
static char 
static char 
static char 
static char 

r 

• print alignment of described in struct path ppf] 
V 

static 

pr_alignQ 
{ 

int nn; /* char count 7 

int more; 
register i; 

for (i = 0, lmax = 0; i < 2++) { 
nn *» stripname(namex[i]); 
if (nn > lmax) 

lmax " nn; 

ncti]-l; 

oW-l; 
siz(i]~ij[i] = 0; 
psli] - seqx[ifc 
po[i] - outflj 

} 

for (nn = nm = 0, more - 1; more;) { 
for (i = more = 0; i < 2; i++) { 

r 

* do we have more of this sequence? 
•/ 

ifC* P s[iD 

continue; 
more-H-; 
if (pp[i].spc) { 

*po[i>+ - ' 

pp[i}spc— ; 

} 

else if (sizf i]) { 
*po[i]++ «= 

si*]--; 

} 

else { 



r lengths of stripped file names */ 
/* jmp index for a path */ 
/* number at start of current line */ 
/* current elem number -- for gapping */ 

/• ptr to current element */ 
/• ptr to next output char slot 7 
/* output line 7 
/• set by starsQ 7 



pr__align 



..pr_align 



/* leading space */ 



I* in a gap 7 



/•we're putting a seq element 



} 



} 



if (islower(*ps[iD) 

*ps[i] - toupper(*ps[i]); 
po[i>+; 
ps[i}*+; 
/* 

* are we at next gap for this seq? 
7 

if(ni[i]~pp[i].x(ij[i]]){ 
/• 

* we need to merge all gaps 

* at this location 
7 

MS\ — PP[i]n[ij[i)++]; 
while (ni[i] = pp[i].x(ij[i]D 
siz[i] 4- pp[i].n[ij[i>+]; 

} 

ni[i)++; 



if (++nn «-» olen || !more && nn) { 
dumpblockQ; 
for (i - 0; i < 2; i++) 

po[i] = out[i]; 
nn = 0; 

} 

} 

} 

r 

* dump a block of lines, including numbers, stars: pr_align() 
7 

static 

dumpblockQ 



dump block 
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..dumpblock 



register i; 

for(i » 0; i < 2; i++) 
•po[iK - '\0'; 

(void) putc('\n', fx); 
for (i « 0; i < 2; i++) { 

if Cout[i] && (*out[i] != * * || *(po[i]) !- ' *)) { 
if(i-0) 

nums(i); 
if (i — 0 && *out[l]) 

starsQ; 
putline (i); 

if (i — 0 && *out[l]) 

fprintf(fx, star); 
if(i-l) 

nums(i); 

} 

} 

} 

* put out a number line: dumpblockO 
v 

static 

nums(ix) nums 
int ix; /* index in out[] holding seq line */ 

{ 

char nline[P_LINE]; 
register i, j; 

register char *pn, *px, *py; 
for(pn = nline, i = 0; i < lmax+P_SPC; i++, pn++) 
*pn = ' '; 

for (i = nc[ix], py = out[ix]; *py; py++, pn++) { 

if (*py — - Ml *py- '-•); 

*pn - 4 '; 
else{ 

if (i%10 = 0 || (i — 1 && nc(ix] !- 1)) { 
j - (i < 0)? -i : i; 
for (px » pn; j; j/= 10, px — ) 

*px = j%10 + *0*; 
if (i< 0) 

*px = 

} 

else 

•pn - * '; 

} 

} 

*pn = '\0'; 
nc[ix] - i; 

for (pn •» nline; *pn; pn++) 

(void) putc(*pn, fx); 
(void) putc(*\n', fx); 

} 

r 

* put out a line (name, [num], seq. [num]): dumpblockO 

•/ < - - 

static 

putline(ix) putline 
int ix; 

{ 



...putline 



int i; 
register char *px; 

for (px =» namex[ix], i = 0; *px && *px !=':'; px++, i++) 

(void) putc(*px, fx); 
for (;i < lmax+P_SPC; i++) 

(void) putcC fx); 
I* these count from 1: 

* ni[] is current element (from 1) 

* nc(] is number at start of current line 
7 

for (px " out[ixt *px; px++) 

(void) putc(*px&0x7F, fx); 
(void) putc('\n\ fx); 

} 

r 

* put a line of stars (seqs always in out[0], out[l]): dumpblockO 
V 

static 
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staisO 

{ 

int i; 

register char *p0, *pl, cx, "px; 
if (!*out(0] || (*outl0] — * * && '(pOtO]) = ") II 
!*out[l] 0 (*out(l] ~ • • && *(pa[l]) — * ')) 
return; 

px = star; 

for (i » lmax+P_SPQ i; i— ) 
*px++ - 4 '; 

for (pO - ourfO], pi - outfit *p0 && *pl; p(>++, pl++) { 
if (isalpha('pO) && isalpha(*pl)) { 

if (xbn^*pO-'A']^bm[*pl-'A']) { 
cx = ""; 
nm++; 

} 

else if (!dna && _dayf*pO- 'A'ppl-'A'] > 0) 
cx-V; 

else 

cx - 4 *; 

} 

else 

cx - ' ' ; 
*px++ = cx; 

} 

*px++ - '\n'; 
*px - *\0'; 

} 

r 

* strip path or prefix from pn, return len: pr_align() 
V 

static 

strip name(pn) 

char *pn; /* file name (may be path) */ 



strip name 



{ 

register char *px, "py; 

py-0; 

for (px «» pn; *px; px++) 

ifCpx — r) 

py - px + 1; 

if(py) 

(void) strcpy(pn, py); 
return (strlen(pn)); 

} 

r 

* cleanupO — cleanup any tmp file 

* getseqO — read in seq, set dna, len, maxlen 
" g_callocQ - callocQ with error checkin 

* readjmpsO — get the good jmps, from tmp file if necessary 

* writejmpsO — write a filled array of jmps to a tmp file: nw() 
V 



#include "nw.h" 
#include <sys/file.h> 

char •jname = "/tmp/homgXXXXXX"; 

FILE *tj; 

int cleanupO; 

long IseekQ; 



/* tmp file for jmps */ 
/• cleanup tmp file */ 



* remove any tmp file if we blow 
V 

cleanup© 

int i; 

{ 

(void) unlink(jname); 

exit(i); 

} 

r 

* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with '<', or 4 >' 

* seq in upper or lower case 
7 

char * 

getseq(file, len) 

char *file; /* file name "/ 

int *len; f* seq len •/ 

{ 

char line[1024], "pseq; 

register char *px, "py; 



cleanup 



getseq 



US 6,740,520 B2 
39 40 

TABLE 1 -continued 



int natgc, tlen; 

FILE -fp; 
if ((fp - fopen(fiie, V')) — 0) { 

fprintf(stderr, **%s: can't read %s\n", prog, file); 

exit(3); 

} 

Ucn - natgc » 0; 

while (fgcts(linc, 1024, fp)) { 

if Clinc — V || •line — '<' || 'line *>') 

continue; 
for (px - line; *px != *\n'; px++) 
if (isupper(*px) || islower(*px)) 
tlen++; 

} 

if ((pseq = malloc((unsigned)(tlen+6))) — 0) { 

fprintf(stderr, "%s: mallocO failed to get %d bytes for %s\n", prog, tlen+6, file); 
exit(l); 

} 

pscqtO] = pseq[l j = pseq[2] = pseq[3] - '\0'; 

...getseq 

py - pseq + 4; 
*len = tlen; 
rewind(fp); 

whUe (fgets(line, 1024, fp)) { 

if ("line V II *Hne «= || *line — '>') 

continue; 
for (px - line; *px !-» *\n'; px++) { 
if (isupper(*px)) 

*py++ = *px; 
else if (islower(*px)) 

*py++ o toupper(*px); 
if (index("ATGCU", *(py-l))) 
natgc++; 

} 

} 

•py++ «= '\0'; 
*py - *\0*; 
(void) fclose(fp); 
dna ° natgc > (tlen/3); 
retum(pseq+4); 

} 

char * 

g_calloc(msg, nx, sz) g_calloc 
char *msg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 



{ 

char *px, *calloc(); 

if ((px - calloc((unsigned)nx, (unsigned)sz)) = 0) { 
if(*msg){ 

fprintf(stderr, "%s: g_calloc0 faUed %s (n» %d, sz= %d)\n", prog, msg, nx, sz); 
exit(l); 

} 

} 

return(px); 

} 

r 

* get final jmps from dx[] or tmp file, set pp[], reset dmax: ma'mQ 
V 

readjmpsO readjmps 
{ 

int fd = -1; 

int siz, iO, il; 

register i, j, xx; 

if(©{ 

(void) fclose(fj); 

if ((fd - openQname, 0_RDONlY, 0)) < 0) { 

£jprintf(stderr, "%s: can't open() %s\n", prog, jname); 
cleanup(l); 

for (i « iO - il = 0, dmaxO = dmax, xx = lenO; ;i++) { 
while (1) { 

for (j - dx[dmax].ijmp; j >■ 0 && dx[d maxj.jp jcfj] >*» xx; j — ) 

...readjmps 

if (j < 0 && dx(dmax].ofiset && fj) { 
(void) lseek(fd, dx{dmaxJ.ofFset, 0); 
(void) read(fd, (char *)&dx[dmax].jp, sizeof(struct jmp)); 
(void) read(fd, (char *)&dx[dmax].offiset, sizeof(dx(dmax].ofiset)); 
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} 

else 
} 



dxfdmax].ijmp - MAXJMP-1; 



break; 



if (i >= JMPS) { 

fprintf(slderr, "%s: too many gaps in alignment\n" f prog); 
cleanup(l); 

} 

if(j>- 0){ 

siz « dx(dmax].jp.n[j]; 
xx « dx[dmax3.jp.x[j]; 
dm ax += siz; 

if (siz < 0) { /* gap in second seq */ 

pp[l].n[il] = -siz; 
xx += siz; 

/• id = xx - yy + lenl - 1 
V 

pp[l]jcfil] = xx - dmax + lenl - 1; 
gapy++; 
ngapy — siz; 
/• ignore MAXGAP when doing end gaps */ 

siz - (-siz < MAXGAP || endgaps)? -siz : MAXGAP; 
il++; 

} 

else if (siz > 0) { /" gap in first seq */ 
pp[0] .n[i0] " siz; 
pp[0] jEfiO] - xx; 
gapx++; 
ngapx +« siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP || endgaps)? siz : MAXGAP; 
i0++; 

} 

} 

else 

break; 

} 

l m reverse the order of jmps 
•/ 

for G « 0, iO— ; j < iO; j++, i0~) { 

i - pp[0]4i3; PP(0].n[j] = pp[0].n[i0]; pp[0].n[i0] - i; 
i - pp[0].xUl PP [0]^[j] - PP[0].x(iO]; PP [0Hi0] - i; 

} 

for (j - 0, il— ; j < il; j++, il— ) { 

i - ppM-nlft PPll]-n[j] = pp[l].n[il]; pp[l].ii[il] - t 
i o pptlMil prflMj] - pp[l]jc[il]; pp[l].x(il] - i; 

} 

if (fd >= 0) 

(void) close (fd); 

if(fj){ 

(void) unlinkG'name); 

fj-0; 

offset = 0; 

} 



* write a filled jmp struct offset of the prev one (if any): nwO 
7 

writejmps(ix) 
int ix; 

{ 

char "mktempQ; 

ifO©{ 

if (mktempQname) < 0) { 

fprintf(stderr, "%s: can't mktempO %s\n", prog, jname); 
cleanup(l); 

} 

if ((fj - fopenQiuime, "w")) — 0) { 

fprintf(stderr, "%s: can't write %s\n", prog, jname); 
erittl); 

} 

} 

(void) fwrite((char *)&dx( ixj.jp, sizeof(struct jmp), 1, fj); 
(void) fwrite((char *)&dx[ixJ.offset ) sizeof(dx(ix].offset), 1, fj); 



write jmps 
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PRO 

Comparison 
Protein 



xxxxxxxxxxxxxxx 

XXXXXYYYYYYY 



(Length = 15 amino acids) 
(Length -12 amino acids) 



% amino acid sequence identity=(the number of identically 
matching amino acid residues between the two polypeptide 
sequences as determined by ALIGN-2) divided by (the total 
number of amino acid residues of the PRO polypeptide)=5 
divided by 15=33.3% 



TABLE 3 



PRO 

Comparison 
Protein 



xxxxxxxxxx 

XXXXXYYYYYYZZYZ 



(Length = 10 amino acids) 
(Length = 35 amino acids) 



TABLE 4 



PRO-DNA 

Comparison 

DNA 



NNNNNNNNNNNNNN 
NNNNNNLLLLLLLLLL 



(Length = 14 nucleotides) 
(Length = 16 nucleotides) 



% nucleic acid sequence identity=(the number of identically 
matching nucleotides between the two nucleic acid 
sequences as determined by ALIGN-2) divided by (the total 
number of nucleotides of the PRO-DNA nucleic acid 
sequence)=6 divided by 14=42.9% 



TABLE 5 



PRO-DNA 

Comparison 

DNA 



NNNNNNNNNNNN 
NNNNLLLW 



(Length = 12 nucleotides) 
(Length = 9 nucleotides) 



10 



15 



% amino acid sequence identity=(the number of identically 
matching amino acid residues between the two polypeptide 20 
sequences as determined by ALIGN-2) divided by (the total 
number of amino acid residues of the PRO polypeptide)=5 
divided by 10=50% 
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% nucleic acid sequence identity«=(the number of identically 

matching nucleotides between the two nucleic acid 

sequences as determined by ALIGN-2) divided by (the total 

number of nucleotides of the PRO-DNA nucleic acid 45 

sequence)=4 divided by 12=33.3% 

II. Compositions and Methods of the Invention 

A. Full-length PRO 19598 Polypeptides 

The present invention provides newly identified and iso- 
lated nucleotide sequences encoding polypeptides referred 50 
to in the present application as PR019598 polypeptides. In 
particular, cDNAs encoding PRO 19598 polypeptides have 
been identified and isolated, as disclosed in further detail in 
the Examples below. It is noted that proteins produced in 
separate expression rounds may be given different PRO 55 
numbers but the UNQ number is unique for any given DNA 
and the encoded protein, and will not be changed. However, 
for sake of simplicity, in the present specification the protein 
encoded by the full length native nucleic acid molecules 
disclosed herein as well as all further native homologues and 60 
variants included in the foregoing definition of PRO, will be 
referred to as "PRO/number", regardless of their origin or 
mode of preparation. 

As disclosed in the Examples below, cDNA clones have 
been deposited with the ATCC. The actual nucleotide 65 
sequences of those clones can readily be determined by the 
skilled artisan by sequencing of the deposited clone using 
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routine methods in the art. The predicted amino acid 
sequence can be determined from the nucleotide sequence 
using routine skill. For the PR019598 polypeptides and 
encoding nucleic acids described herein, Applicants have 
identified what is believed to be the reading frame best 
identifiable with the sequence information available at the 
time. 

Using the ALIGN-2 sequence alignment computer pro- 
gram referenced above, it has been found that the full-length 
native sequence PR019598 (shown in FIG. 2 and SEQ ID 
NO:2) has certain amino acid sequence identity with 
AF184971_1. Accordingly, it is presently believed that the 
PR019598 polypeptide disclosed in the present application 
is a newly identified member of the cytokine receptor 
protein family and may possess one or more biological 
and/or immunological activities or properties typical of that 
protein family. 

B. PR019598 Polypeptide Variants 

In addition to the full-length native sequence PRO 195 98 
polypeptides described herein, it is contemplated that 
PR019598 variants can be prepared. PR019598 variants 
can be prepared by introducing appropriate nucleotide 
changes into the PR019598 DNA, and/or by synthesis of the 
desired PR019598 polypeptide. Those skilled in the art will 
appreciate that amino acid changes may alter post- 
translational processes of the PRO 19598, such as changing 
the number or position of glycosylation sites or altering the 
membrane anchoring characteristics. 

Variations in the native full-length sequence PRO 19598 
or in various domains of the PRO 195 98 described herein, 
can be made, for example, using any of the techniques and 
guidelines for conservative and non-conservative mutations 
set forth, for instance, in U.S. Pat. No. 5,364,934. Variations 
may be a substitution, deletion or insertion of one or more 
codons encoding the PR019598 that results in a change in 
the amino acid sequence of the PRO 195 98 as compared with 
the native sequence PRO 19598. Optionally the variation is 
by substitution of at least one amino acid with any other 
amino acid in one or more of the domains of the PRO 19598. 
Guidance in determining which amino acid residue may be 
inserted, substituted or deleted without adversely affecting 
the desired activity may be found by comparing the 
sequence of the PRO 19598 with that of homologous known 
protein molecules and minimizing the number of amino acid 
sequence changes made in regions of high homology. Amino 
acid substitutions can be the result of replacing one amino 
acid with another amino acid having similar structural 
and/or chemical properties, such as the replacement of a 
leucine with a serine, i.e., conservative amino acid replace- 
ments. Insertions or deletions may optionally be in the range 
of about 1 to 5 amino acids. The variation allowed may be 
determined by systematically making insertions, deletions or 
substitutions of amino acids in the sequence and testing the 
resulting variants for activity exhibited by the full-length or 
mature native sequence. 

PR019598 polypeptide fragments are provided herein. 
Such fragments may be truncated at the N-terminus or 
C-terminus, or may lack internal residues, for example, 
when compared with a full length native protein. Certain 
fragments lack amino acid residues that are not essential for 
a desired biological activity of the PRO 19598 polypeptide. 

PRO 19598 fragments may be prepared by any of a 
number of conventional techniques. Desired peptide frag- 
ments may be chemically synthesized. An alternative 
approach involves generating PRO 19598 fragments by 
enzymatic digestion, e.g., by treating the protein with an 
enzyme known to cleave proteins at sites defined by par- 
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ticular amino acid residues, or by digesting the DNA with 
suitable restriction enzymes and isolating the desired frag- 
ment. Yet another suitable technique involves isolating and 
amplifying a DNA fragment encoding a desired polypeptide 
fragment, by polymerase chain reaction (PCR). Oligonucle- 
otides that define the desired termini of the DNA fragment 
are employed at the 5' and 3' primers in the PCR. Preferably, 
PRO 19598 polypeptide fragments share at least one biologi- 
cal and/or immunological activity with the native 
PR019598 polypeptide disclosed herein. 

In particular embodiments, conservative substitutions of 
interest are shown in Table 6 under the heading of preferred 
substitutions. If such substitutions result in a change in 
biological activity, then more substantial changes, denomi- 
nated exemplary substitutions in Table 6, or as further 
described below in reference to amino acid classes, are 
introduced and the products screened. 



TABLE 6 


Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Substitutions 


Ala (A) 


val; leu; lie 


val 


Arg(R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp (D) 


glu 


glu 


Cys(C) 


ser 


ser 


Gin (Q) 


asn 


asn 


Glu (E) 


asp 


asp 


Gly (G) 


pro; ala 


ala 


His (H) 


asn; gin; lys; arg 


arg 


IleG) 


leu; val; met; ala; phe; 


leu 




norleucine 




Leu (L) 


norleucine; ile; val; 


ile 




met; ala; phe 




Lys(K) 


arg; gin; asn 


arg 


Met (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala; tyr 


leu 


Pro (?) 


ala 


ala 


Ser (S) 


thr 


thr 


Thr (T) 


ser 


ser 


Trp(W) 


tyr, phe 


tyr 


Tyr(Y) 


trp; phe; thr, ser 


phe 


Val (V) 


ile; leu; met; phe; 


leu 




ala; norleucine 





Substantial modifications in function or immunological 
identity of the PR019598 polypeptide are accomplished by 
selecting substitutions that differ significantly in their effect 
on maintaining (a) the structure of the polypeptide backbone 
in the area of the substitution, for example, as a sheet or 
helical conformation, (b) the charge or hydrophobicity of the 
molecule at the target site, or (c) the bulk of the side chain. 
Naturally occurring residues are divided into groups based 
on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu, ile; 

(2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a 
member of one of these classes for another class. Such 
substituted residues also may be introduced into the conser- 
vative substitution sites or, more preferably, into the remain- 
ing (non-conserved) sites. 

The variations can be made using methods known in the 
art such as oligonucleotide-mediated (site-directed) 
mutagenesis, alanine scanning, and PCR mutagenesis. Site- 
directed mutagenesis [Carter et aL, Nucl. Acids Res., 
13:4331 (1986); Zoller et aL, Nucl. Acids Res., 10:6487 



10,520 B2 

46 

(1987)], cassette mutagenesis [Wells et aL, Gene, 34:315 
(1985)], restriction selection mutagenesis [Wells et aL, Plu~ 
los. Trans. R. Soc. London SerA, 317:415 (1986)] or other 
known techniques can be performed on the cloned DNA to 

5 produce the PR019598 variant DNA. 

Scanning amino acid analysis can also be employed to 
identify one or more amino acids along a contiguous 
sequence. Among the preferred scanning amino acids are 
relatively small, neutral amino acids. Such amino acids 

10 include alanine, glycine, serine, and cysteine. Alanine is 
typically a preferred scanning amino acid among this group 
because it eliminates the side-chain beyond the beta-carbon 
and is less likely to alter the main-chain conformation of the 
variant [Cunningham and Wells, Science, 244: 1081-1085 

15 (1989)]. Alanine is also typically preferred because it is the 
most common amino acid. Further, it is frequently found in 
both buried and exposed positions [Creighton, The Proteins, 
(W. H. Freeman & Co., N.Y.); Chothia, J. Mol Biol, 150:1 
(1976)]. If alanine substitution does not yield adequate 

20 amounts of variant, an isoteric amino acid can be used. 
C. Modifications of PR019598 

Covalent modifications of PR019598 are included within 
the scope of this invention. One type of covalent modifica- 
tion includes reacting targeted amino acid residues of a 

25 PRO 19598 polypeptide with an organic derivatizing agent 
that is capable of reacting with selected side chains or the N- 
or C-terminal residues of the PR019598. Derivatization 
with bifunctional agents is useful, for instance, for crosslink- 
ing PRO 195 98 to a water-insoluble support matrix or sur- 

30 face for use in the method for purifying anti-PR019598 
antibodies, and vice-versa. Commonly used crosslinking 
agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, 
glutaraldehyde, N-hydroxysuccinimade esters, for example, 
esters with 4-azidosalicylic acid, homobifunctional 

35 imidoesters, including disuccinimidyl esters such as 3,3'- 
dithiobis(succinimidylpropionate), bifunctional maleimides 
such as bis-N-maleimido-l,8-octane and agents such as 
methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl 

40 and asparaginyl residues to the corresponding glutamyl and 
aspartyl residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or 
threonyl residues, methylation of the a-amino groups of 
lysine, arginine, and histidine side chains [T. E. Creighton, 

45 Proteins: Structure and Molecular Properties, W. H. Free- 
man & Co., San Francisco, pp. 79-86 (1983)], acetylation of 
the N-tenninal amine, and amidation of any C-terminal 
carboxyl group. 

Another type of covalent modification of the PRO 19598 

50 polypeptide included within the scope of this invention 
comprises altering the native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is 
intended for purposes herein to mean deleting one or more 
carbohydrate moieties found in native sequence PRO 19598 

55 (either by removing the underlying glycosylation site or by 
deleting the glycosylation by chemical and/or enzymatic 
means), and/or adding one or more glycosylation sites that 
are not present in the native sequence PR019598. In 
addition, the phrase includes qualitative changes in the 

60 glycosylation of the native proteins, involving a change in 
the nature and proportions of the various carbohydrate 
moieties present. 

Addition of glycosylation sites to the PRO 19598 polypep- 
tide may be accomplished by altering the amino acid 

65 sequence. The alteration may be made, for example, by the 
addition of, or substitution by, one or more serine or threo- 
nine residues to the native sequence PRO 19598 (for 
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O-linked glycosylation sites). The PR019598 amino acid 
sequence may optionally be altered through changes at the 
DNA level, particularly by mutating the DNA encoding the 
PR019598 polypeptide at preselected bases such that 
codons are generated that will translate into the desired 
amino acids. 

Another means of increasing the number of carbohydrate 
moieties on the PRO 19598 polypeptide is by chemical or 
enzymatic coupling of glycosides to the polypeptide. Such 
methods are described in the art, e.g., in WO 87/05330 
published Sep. 11, 1987, and in Aplin and Wriston, CRC 
Crit. Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the 
PRO 19598 polypeptide may be accomplished chemically or 
enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for 
glycosylation. Chemical deglycosylation techniques are 
known in the art and described, for instance, by 
Hakimuddin, et *L,Arck Biochem. Biophys., 259:52 (1987) 
and by Edge et *L,AnaL Biochem., 118:131 (1981). Enzy- 
matic cleavage of carbohydrate moieties on polypeptides 
can be achieved by the use of a variety of endo- and 
exo-glycosidases as described by Thotakura et al., Meth. 
EnzymoL, 138:350 (1987). 

Another type of covalent modification of PRO 19598 
comprises linking the PRO 19598 polypeptide to one of a 
variety of nonproteinaceous polymers, e.g., polyethylene 
glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in 
the manner set forth in U.S. Pat. Nos. 4,640,835; 4,496,689; 
4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

The PRO 19598 of the present invention may also be 
modified in a way to form a chimeric molecule comprising 
PRO 195 98 fused to another, heterologous polypeptide or 
amino acid sequence. 

In one embodiment, such a chimeric molecule comprises 
a fusion of the PRO 19598 with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can 
selectively bind. The epitope tag is generally placed at the 
amino- or carboxyl-terminus of the PRO 19598. The pres- 
ence of such epitope-tagged forms of the PR019598 can be 
detected using an antibody against the tag polypeptide. Also, 
provision of the epitope tag enables the PRO 19598 to be 
readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the 
epitope tag. Various tag polypeptides and their respective 
antibodies are well known in the art. Examples include 
poly-histidine (poly-his) or poly-histidine-glycine (poly-his- 
gly) tags; the flu HA tag polypeptide and its antibody 12CA5 
[Field et al., Mol Cell. Biol., 8:2159-2165 (1988)]; the 
c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 
antibodies thereto [Evan et al., Molecular and Cellular 
Biology, 5:3610-3616 (1985)]; and the Herpes Simplex 
virus glycoprotein D (gD) tag and its antibody [Paborsky et 
al., Protein Engineering, 3(6):547-553 (1990)]. Other tag 
polypeptides include the Flag-peptide [Hopp et al., 
BioTecftnology, 6:1204-1210 (1988)]; the KT3 epitope pep- 
tide [Martin et al., Science, 255:192-194 (1992)]; an 
a-tubulin epitope peptide [Skinner et al., J. Biol. Chem., 
266:15163-15166 (1991)]; and the T7 gene 10 protein 
peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. 
USA, 87:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may 
comprise a fusion of the PR019598 with an immunoglobu- 
lin or a particular region of an immunoglobulin. For a 
bivalent form of the chimeric molecule (also referred to as 
an "immunoadhesin"), such a fusion could be to the Fc 
region of an IgG molecule. The Ig fusions preferably include 



the substitution of a soluble (transmembrane domain deleted 
or inactivated) form of a PRO 19598 polypeptide in place of 
at least one variable region within an Ig molecule. In a 
particularly preferred embodiment, the immunoglobulin 

5 fusion includes the hinge, CH2 and CH3, or the hinge, CHI, 
CH2 and CH3 regions of an IgGl molecule. For the pro- 
duction of immunoglobulin fusions see also U.S. Pat. No. 
5,428,130 issued Jun. 27, 1995. 
D. Preparation of PR019598 

The description below relates primarily to production of 
PRO 19598 by culturing cells transformed or transfected 
with a vector containing PRO 19598 nucleic acid. It is, of 
course, contemplated that alternative methods, which are 
well known in the art, may be employed to prepare 
PR019598. For instance, the PR019598 sequence, or por- 

15 tions thereof, may be produced by direct peptide synthesis 
using solid-phase techniques [see, e.g., Stewart et al., Solid- 
Phase Peptide Synthesis, W. H. Freeman Co., San Francisco, 
Calif. (1969); Merrifield, J. Am. Chem. Soc., 85:2149-2154 
(1963)]. In vitro protein synthesis may be performed using 

20 manual techniques or by automation. Automated synthesis 
may be accomplished, for instance, using an Applied Bio- 
systems Peptide Synthesizer (Foster City, Calif.) using 
manufacturer's instructions. Various portions of the 
PR019598 may be chemically synthesized separately and 

25 combined using chemical or enzymatic methods to produce 
the full-length PR019598. 
1. Isolation of DNA Encoding PR019598 

DNA encoding PR019598 may be obtained from a cDNA 
library prepared from tissue believed to possess the 

30 PR019598 mRNA and to express it at a detectable level. 
Accordingly, human PR019598 DNA can be conveniently 
obtained from a cDNA library prepared from human tissue, 
such as described in the Examples. The PRO 19598- 
encoding gene may also be obtained from a genomic library 

35 or by known synthetic procedures (e.g., automated nucleic 
acid synthesis). 

Libraries can be screened with probes (such as antibodies 
to the PR019598 or oligonucleotides of at least about 20-80 
bases) designed to identify the gene of interest or the protein 

40 encoded by it. Screening the cDNAor genomic library with 
the selected probe may be conducted using standard 
procedures, such as described in Sambrook et al., Molecular 
Cloning: A Laboratory Manual (New York: Cold Spring 
Harbor Laboratory Press, 1989). An alternative means to 

45 isolate the gene encoding PRO 19598 is to use PCR meth- 
odology [Sambrook et al., supra; Dieffenbach et al., PCR 
Primer: A Laboratory Manual (Cold Spring Harbor Labo- 
ratory Press, 1995)]. 

The Examples below describe techniques for screening a 

50 cDNA library. The oligonucleotide sequences selected as 
probes should be of sufficient length and sufficiently unam- 
biguous that false positives are minimized. The oligonucle- 
otide is preferably labeled such that it can be detected upon 
hybridization to DNA in the library being screened. Methods 

55 of labeling are well known in the art, and include the use of 
radiolabels like 32 P-labeled ATP, biotinylation or enzyme 
labeling. Hybridization conditions, including moderate 
stringency and high stringency, are provided in Sambrook et 
al., supra. 

60 Sequences identified in such library screening methods 
can be compared and aligned to other known sequences 
deposited and available in public databases such as Gen- 
Bank or other private sequence databases. Sequence identity 
(at either the amino acid or nucleotide level) within defined 

65 regions of the molecule or across the full-length sequence 
can be determined using methods known in the art and as 
described herein. 
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Nucleic acid having protein coding sequence may be strain W3110 may be modified to effect a genetic mutation 

obtained by screening selected cDNA or genomic libraries in the genes encoding proteins endogenous to the host, with 

using the deduced amino acid sequence disclosed herein for examples of such hosts including E. coli W3110 strain 1A2, 

the first time, and, if necessary, using conventional primer which has the complete genotype tonA ; £. coli W3110 

extension procedures as described in Sambrook et al., supra, 5 slram a£4, which has the complete genotype tonA ptr3; E. 

to detect precursors and processing intermediates of mRNA co \i W3110 strain 27C7 (ATTC 55,244), which has the 

that may not have been reverse-transcribed into cDNA. complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP 

2. Selection and Transformation of Host Cells ompT j^r. £ coli W3n0 strain 37D6> which has the 

Host cells are transfected or trar^formed^th expression ^ tQnA 3 hoA £15 ( F _ lac)169 degP 

or cloning vectors described herein for PR019598 produc- 10 ^ ^ G kaQ r £ cqU W3n0 straifl ^4, which is 

tion and cultured m conventional nutrient media modified as 3?D6 ^ a non _^ ciQ resistant de ^ deletion 

appropnate for inducing promoters, selecting transformants, . , „ .. A . ' . - i 

or^mjlifyingthegen^ mu at,on, and an £. coh strair .having mutant penplasmic 

culture conditions! such as media, temperature, pH and the P?*^^ 1 ^ 1D U S : PaL N ?" ****** . lSSUed Au f^' 

like, can be selected by the skilled artisan without undue is 1990 -Alternatively ,m vitro methods of clomng, e.g PCR 

experimentation. In general, principles, protocols, and prac- or olher nucIeic acid Polymerase reactions, are suitable, 
tical techniques for maximizing the productivity of cell In addition to prokaryotes, eukaryotic microbes such as 

cultures can be found in Mammalian Cell Biotechnology: a filamentous fungi or yeast are suitable cloning or expression 

Practical Approach, M. Butler, ed. (IRL Press, 1991) and hosts for PR019598-encoding vectors. Saccharomyces cer- 

Sambrook et al., supra. 20 evisiae is a commonly used lower eukaryotic host microor- 

Methods of eukaryotic cell transfection and prokaryotic ganism. Others include Schizosaccliaromyces pombe (Beach 

cell transformation are known to the ordinarily skilled and Nurse, Nature, 290:140 [1981]; EP 139,383 published 

artisan, for example, CaCL>, CaP0 4 , liposome-mediated and May 2, 1985); Kluyveromyces hosts (U.S. Pat. No. 4,943, 

eiectroporation. Depending on the host cell used, transfer- 529; Fleer et al., Bio/Technology, 9:968-975 (1991)) such as, 

mation is performed using standard techniques appropriate 25 e.g., K. lactis (MW98-8C, CBS683, CBS4574; Louvencourt 

to such cells. The calcium treatment employing calcium et al., /. Bacterial, 154(2): 737-742 [1983]), K. fragilis 

chloride, as described in Sambrook et al., supra, or elec- (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. wick- 

troporation is generally used for prokaryotes. Infection with eramii (ATCC 24,178), K. waltii (ATCC 56,500), K. droso- 

Agrobacterium tumefaciens is used for transformation of philarum (ATCC 36,906; Van den Berg et al., Biol 

certain plant cells, as described by Shaw et al., Gene, 23:315 30 Technology, 8:135 (1990)), K. thermotolerans, and K. 

(1983) and WO 89/05859 published Jun. 29, 1989. For marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 

mammalian cells without such cell walls, the calcium phos- 183,070; Sreekrishna et al, J. Basic Microbiol, 28:265-278 

phate precipitation method of Graham and van der Eb, [1988]); Candida; Trichodenna reesia (EP 244,234); Neu- 

Virology, 52:456-457 (1978) can be employed. General rospora crassa (Case et al., Proc. Natl Acad. ScL USA, 

aspects of mammalian cell host system transfections have 35 76:5259-5263 [1979]); Schwanniomyces such as Schwan- 

been described in U.S. Pat. No. 4,399,216. Transformations niomyces occidental^ (EP 394,538 published Oct. 31, 

into yeast are typically carried out according to the method 1990); and filamentous fungi such as, e.g., Neurospora, 

of Van Solingen et al.,/. Bact., 130:946 (1977) and Hsiao et Penicillium, Tolypocladium (WO 91/00357 published Jan. 

Proc. Natl. Acad. ScL (USA), 76:3829 (1979). However, 10, 1991), and Aspergillus hosts such as A. nidulans 

other methods for introducing DNA into cells, such as by 40 (Ballance et al., Biochem. Biophys. Res. Commun., 

nuclear microinjection, eiectroporation, bacterial protoplast 112:284-289 [1983]; Tilburn et al., Gene, 26:205-221 

fusion with intact cells, or polycations, e.g., polybrene, [1983]; Yeltonetal., Proc. Natl Acad. Sci. USA, 81: 

polyormthine, may also be used. For various techniques for 1470-1474 [1984]) and A. niger (Kelly and Hynes, EMBO 

traiisformmgmammaUancells,seeKeownetal.,Me^^//j ^ 4:475-479 [1985]). Methylotropic yeasts are suitable 

Enzymology, 185:527-537 (1990) and Mansour et al., 45 herein and include, but are not limited to, yeast capable of 

Nature, 336:348-352 (1988). growth on methanol selected from the genera consisting of 

Suitable host cells for cloning or expressing the DNA in Hansenula, Candida, Kloeckera, Pichia, Saccharomyces, 

the vectors herein include prokaryote, yeast, or higher Torulopsis, and Rhodotorula. A list of specific species that 

eukaryote cells. Suitable prokaryotes include but are not are exemplary of this class of yeasts may be found in C. 

limited to eubacteria, such as Gram-negative or Gram- 50 Anthony, The Biochemistry of Methylotroptis, 269 (1982). 
positive organisms, for example, Enterobacteriaceae such as Suitable host cells for the expression of glycosylated 

E. coli. Various E. coli strains are publicly available, such as PR019598 are derived from multicellular organisms. 

E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 Examples of invertebrate cells include insect cells such as 

(ATCC 31,537); E. coli strain W3110 (ATCC 27,325) and Drosophila S2 and Spodoptera Sf9, as well as plant cells. 

K5 772 (ATCC 53,635), Other suitable prokaryotic host 55 Examples of useful mammalian host cell lines include 

cells include Enterobacteriaceae such as Escherichia, e.g., E. Chinese hamster ovary (CHO) and COS cells. More specific 

coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, examples include monkey kidney CV1 line transformed by 

e.g., Salmonella typhimurium, Serratia, e.g., Serratia SV40 (COS-7, ATCC CRL 1651); human embryonic kidney 

marcescans, and Shigella, as well as Bacilli such as B. line (293 or 293 cells subcloned for growth in suspension 

subtilis and B. licheniformis (e.g., B. licheniformis 41P 60 culture, Graham et al., J. Gen Virol, 36:59 (1977)); Chinese 

disclosed in DD 266,710 published Apr. 12, 1989), hamster ovary cellsADHFR (CHO, Urlaub and Chasin, 

Pseudomonas such as P. aeruginosa, and Streptomyces. Proc. Natl. Acad. ScL USA, 77:4216 (1980)); mouse Sertoli 

These examples are illustrative rather than limiting. Strain cells (TM4, Mather, Biol Reprod, 23:243-251 (1980)); 

W3110 is one particularly preferred host or parent host human lung cells (W138, ATCC CCL 75); human liver cells 

because it is a common host strain for recombinant DNA 65 (Hep G2, HB 8065); and mouse mammary tumor (MMT 

product fermentations. Preferably, the host cell secretes 060562, ATCC CCL51). The selection of the appropriate 

minimal amounts of proteolytic enzymes. For example, host cell is deemed to be within the skill in the art. 
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3. Selection and Use of a Replicable Vector lacking the ability to grow in tryptophan, for example, 

The nucleic acid (e.g., cDNAor genomic DNA) encoding ATCC No. 44076 or PEP4-1 [Jones, Genetics, 85:12 

PR019598 may be inserted into a replicable vector for (1977)]. 

cloning (amplification of the DNA) or for expression. Van- Expression and cloning vectors usually contain a pro- 

ous vectors are publicly available. The vector may, for 5 moter operably linked to the PR019598-encoding nucleic 

example, be in the form of a plasm id, cosmid, viral particle, acid sequence to direct mRNA synthesis. Promoters recog- 

or phage. The appropriate nucleic acid sequence may be nized by a variety of potential host cells are well known, 

inserted into the vector by a variety of procedures. In Promoters suitable for use with prokaryotic hosts include the 

general, DNA is inserted into an appropriate restriction (^-lactamase and lactose promoter systems [Chang et al., 

endonuclease site(s) using techniques known in the art. 10 Nature, 275:615 (1978); Goeddel et al., Nature, 281:544 

Vector components generally include, but are not limited to, (1979)], alkaline phosphatase, a tryptophan (trp) promoter 

one or more of a signal sequence, an origin of replication, system [Goeddel, Nucleic Acids Res., 8:4057 (1980); EP 

one or more marker genes, an enhancer element, a promoter, 36,776], and hybrid promoters such as the tac promoter 

and a transcription termination sequence. Construction of [deBoeret al., Proc. Natl. Acad ScL USA, 80:21-25 (1983)]. 

suitable vectors containing one or more of these components 15 Promoters for use in bacterial systems also will contain a 

employs standard ligation techniques which are known to Shine-Dalgarno (S.D.) sequence operably linked to the DNA 

the skilled artisan. encoding PR019598. 

The PRO 195 98 may be produced recombinantly not only Examples of suitable promoting sequences for use with 

directly, but also as a ftision polypeptide with a heterologous yeast hosts include the promoters for 3-phosphoglycerate 

polypeptide, which may be a signal sequence or other 20 kinase [Hitzeman et al.,/. Biol Chem., 255:2073 (1980)] or 

polypeptide having a specific cleavage site at the N-terminus other glycolytic enzymes [Hess et al., J. Adv. Enzyme Reg., 

of the mature protein or polypeptide. In general, the signal 7:149 (1968); Holland, Biochemistry, 17:4900 (1978)], such 

sequence may be a component of the vector, or it may be a as enolase, glyceraldehyde-3-phosphate dehydrogenase, 

part of the PR019598-encoding DNA that is inserted into hexokinase, pyruvate decarboxylase, phosphofructokinase, 

the vector. The signal sequence may be a prokaryotic signal 25 glucose-6-phosphate isomerase, 3-phosphoglycerate 

sequence selected, for example, from the group of the mutase, pyruvate kinase, triosephosphate isomerase, phos- 

alkaline phosphatase, penicillinase, lpp, or heat-stable phoglucose isomerase, and glucokinase. 

enterotoxin II leaders. For yeast secretion the signal Other yeast promoters, which are inducible promoters 

sequence may be, e.g., the yeast invertase leader, alpha having the additional advantage of transcription controlled 

factor leader (including Saccharomyces and Kluyveromyces 30 by growth conditions, are the promoter regions for alcohol 

a-factor leaders, the latter described in U.S. Pat. No. 5,010, dehydrogenase 2, isocytochrome C, acid phosphatase, deg- 

182), or acid phosphatase leader, the C. albicans glucoamy- radative enzymes associated with nitrogen metabolism, 

lase leader (EP 362,179 published Apr. 4, 1990), or the metallothionein, glyceraldehyde-3-phosphate 

signal described in WO 90/13646 published Nov. 15, 1990. dehydrogenase, and enzymes responsible for maltose and 

In mammalian cell expression, mammalian signal sequences 35 galactose utilization. Suitable vectors and promoters for use 

may be used to direct secretion of the protein, such as signal in yeast expression are further described in EP 73,657. 

sequences from secreted polypeptides of the same or related PRO 19598 transcription from vectors in mammalian host 

species, as well as viral secretory leaders. cells is controlled, for example, by promoters obtained from 

Both expression and cloning vectors contain a nucleic the genomes of viruses such as polyoma virus, fowlpox virus 

acid sequence that enables the vector to replicate in one or 40 (UK 2,211,504 published Jul. 5, 1989), adenovirus (such as 

more selected host cells. Such sequences are well known for Adenovirus 2), bovine papilloma virus, avian sarcoma virus, 

a variety of bacteria, yeast, and viruses. The origin of cytomegalovirus, a retrovirus, hepatitis-B virus and Simian 

replication from the plasmid pBR322 is suitable for most Virus 40 (SV40), from heterologous mammalian promoters, 

Gram-negative bacteria, the 2 //plasmid origin is suitable for e.g., the actin promoter or an inmmunoglobulin promoter, 

yeast, and various viral origins (SV40, polyoma, adenovirus, 45 and from heat -shock promoters, provided such promoters 

VSV or BPV) are useful for cloning vectors in mammalian are compatible with the host cell systems, 

cells. Transcription of a DNA encoding the PR019598 by 

Expression and cloning vectors will typically contain a higher eukaryotes may be increased by inserting an enhancer 

selection gene, also termed a selectable marker. Typical sequence into the vector. Enhancers are cis-acting elements 

selection genes encode proteins that (a) confer resistance to 50 of DNA, usually about from 10 to 300 bp, that act on a 

antibiotics or other toxins, e.g., ampicillin, neomycin, promoter to increase its transcription. Many enhancer 

methotrexate, or tetracycline, (b) complement auxotrophic sequences are now known from mammalian genes (glob in, 

deficiencies, or (c) supply critical nutrients not available elastase, albumin, a-fetoprotein, and insulin). Typically, 

from complex media, e.g., the gene encoding D-alanine however, one will use an enhancer from a eukaryotic cell 

racemase for Bacilli. 55 virus. Examples include the SV40 enhancer on the late side 

An example of suitable selectable markers for mammalian of the replication origin (bp 100-270), the cytomegalovirus 

cells are those that enable the identification of cells compe- early promoter enhancer, the polyoma enhancer on the late 

tent to take up the PR019598-encoding nucleic acid, such as side of the replication origin, and adenovirus enhancers. The 

DHFR or thymidine kinase. An appropriate host cell when enhancer may be spliced into the vector at a position 5' or 3' 

wild-type DHFR is employed is the CHO cell line deficient 60 to the PRO 19598 coding sequence, but is preferably located 

in DHFR activity, prepared and propagated as described by at a site 5' from the promoter. 

Urlaub et a\.,Proc. Natl. Acad Sci. USA, 77:4216 (1980). A Expression vectors used in eukaryotic host cells (yeast, 

suitable selection gene for use in yeast is the trpl gene fungi, insect, plant, animal, human, or nucleated cells from 

present in the yeast plasmid YRp7 [Stinchcomb et al., other multicellular organisms) will also contain sequences 

Nature, 282:39 (1979); Kingsmanet al., Gene, 7:141 (1979); 65 necessary for the termination of transcription and for stabi- 

Tschemper et al., Gene, 10:157 (1980)]. The trpl gene lizing the mRNA. Such sequences are commonly available 

provides a selection marker for a mutant strain of yeast from the 5' and, occasionally 3', untranslated regions of 
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eukaryotic or viral DNAs or cDNAs. These regions contain some and gene mapping and in the generation of anti-sense 

nucleotide segments transcribed as polyadenylated frag- RNA and DNA. PRO 19598 nucleic acid will also be useful 

ments in the untranslated portion of the m RNA encoding for the preparation of PRO 19598 polypeptides by the recom- 

PR019598. binant techniques described herein. 

Still other methods, vectors, and host cells suitable for 5 The full-length native sequence PRO 19598 gene, or por- 

adaptation to the synthesis of PRO 19598 in recombinant tions thereof, may be used as hybridization probes for a 

vertebrate cell culture are described in Gething et a\.,Nature, cDNA library to isolate the full-length PRO 19598 cDNA or 

293:620-625 (1981); Mantei et al., Nature, 281:40-46 to isolate still other cDNAs (for instance, those encoding 

(1979) ; EP 117,060; and EP 117,058. naturallyK>ccurring variants of PR019598 or PR019598 

4. Detecting Gene Amplification/Expression 10 from other species) which have a desired sequence identity 
Gene amplification and/or expression may be measured in to the native PR019598 sequence disclosed herein. 

a sample directly, for example, by conventional Southern Optionally, the length of the probes will be about 20 to about 

blotting, Northern blotting to quantitate the transcription of 50 bases. The hybridization probes may be derived from at 

mRNA [Thom as, Pnoc. Natl. Acad. ScL USA, 77:5201-5205 least partially novel regions of the full length native nucle- 

(1980) ], dot blotting (DNA analysis), or in situ 15 otide sequence wherein those regions may be determined 
hybridization, using an appropriately labeled probe, based without undue experimentation or from genomic sequences 
on the sequences provided herein. Alternatively, antibodies including promoters, enhancer elements and introns of 
may be employed that can recognize specific duplexes, native sequence PRO 19598. By way of example, a screening 
including DNA duplexes, RNA duplexes, and DNA-RNA method will comprise isolating the coding region of the 
hybrid duplexes or DNA-protein duplexes. The antibodies in 20 PRO 19598 gene using the known DNA sequence to synthe- 
turn may be labeled and the assay may be carried out where size a selected probe of about 40 bases. Hybridization probes 
the duplex is bound to a surface, so that upon the formation may be labeled by a variety of labels, including radionucle- 
of duplex on the surface, the presence of antibody bound to otides such as 32 P or 35 S, or enzymatic labels such as 
the duplex can be detected. alkaline phosphatase coupled to the probe via avidin/biotin 

Gene expression, alternatively, may be measured by 25 coupling systems. Labeled probes having a sequence 

immunological methods, such as immunohistochemical complementary to that of the PRO 195 98 gene of the present 

staining of cells or tissue sections and assay of cell culture invention can be used to screen libraries of human cDNA, 

or body fluids, to quantitate directly the expression of gene genomic DNA or mRNA to determine which members of 

product. Antibodies useful for immunohistochemical stain- such libraries the probe hybridizes to. Hybridization tech- 

ing and/or assay of sample fluids may be either monoclonal 30 niques are described in further detail in the Examples below, 

or polyclonal, and may be prepared in any mammal. Any EST sequences disclosed in the present application 

Conveniently, the antibodies may be prepared against a may similarly be employed as probes, using the methods 

native sequence PR019598 polypeptide or against a syn- disclosed herein. 

thetic peptide based on the DNA sequences provided herein Other useful fragments of the PR019598 nucleic acids 

or against exogenous sequence fused to PR019598 DNA 35 include antisense or sense oligonucleotides comprising a 

and encoding a specific antibody epitope. singe-stranded nucleic acid sequence (either RNA or DNA) 

5. Purification of Polypeptide capable of binding to target PR019598 mRNA (sense) or 
Forms of PR019598 may be recovered from culture PR019598 DNA (antisense) sequences. Antisense or sense 

medium or from host cell lysates. If membrane-bound, it can oligonucleotides, according to the present invention, com- 
be released from the membrane using a suitable detergent 40 prise a fragment of the coding region of PR019598 DNA. 
solution (e.g. Triton-X 100) or by enzymatic cleavage. Cells Such a fragment generally comprises at least about 14 
employed in expression of PRO 19598 can be disrupted by nucleotides, preferably from about 14 to 30 nucleotides. The 
various physical or chemical means, such as freeze-thaw ability to derive an antisense or a sense oligonucleotide, 
cycling, sonication, mechanical disruption, or cell lysing based upon a cDNA sequence encoding a given protein is 
agents. 45 described in, for example, Stein and Cohen (Cancer Res. 

It may be desired to purify PR019598 from recombinant 48:2659, 1988) and van der Krol et al. (BioTechniques 

cell proteins or polypeptides. The following procedures are 6:958, 1988). 

exemplary of suitable purification procedures: by fraction- Binding of antisense or sense oligonucleotides to target 
ation on an ion-exchange column; ethanol precipitation; nucleic acid sequences results in the formation of duplexes 
reverse phase HPLC; chromatography on silica or on a 50 that block transcription or translation of the target sequence 
cation-exchange resin such as DEAE; chromatofocusing; by one of several means, including enhanced degradation of 
SDS-PAGE; ammonium sulfate precipitation; gel filtration the duplexes, premature termination of transcription or 
using, for example, Sephadex G-75; protein A Sepharose translation, or by other means. The antisense oligonucle- 
columns to remove contaminants such as IgG; and metal otides thus may be used to block expression of PRO 195 98 
chelating columns to bind epitope-tagged forms of the 55 proteins. Antisense or sense oligonucleotides further com- 
PROl 9598. Various methods of protein purification may be prise oligonucleotides having modified sugar- 
employed and such methods are known in the art and phosphodiester backbones (or other sugar linkages, such as 
described for example in Deutscher, Methods in those described in WO 91/06629) and wherein such sugar 
Enzymology, 182 (1990); Scopes, Protein Purification: Prin- linkages are resistant to endogenous nucleases. Such oligo- 
ciples and Practice, Springer- Verlag, New York (1982). The 60 nucleotides with resistant sugar linkages are stable in vivo 
purification step(s) selected will depend, for example, on the (i.e., capable of resisting enzymatic degradation) but retain 
nature of the production process used and the particular sequence specificity to be able to bind to target nucleotide 
PRO 19598 produced. sequences. 

E. Uses for PRO 19598 Other examples of sense or antisense oligonucleotides 

Nucleotide sequences (or their complement) encoding 65 include those oligonucleotides which are covalently linked 

PRO 19598 have various applications in the art of molecular to organic moieties, such as those described in WO 

biology, including uses as hybridization probes, in chromo- 90/10048, and other moieties that increases affinity of the 
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oligonucleotide for a target nucleic acid sequence, such as to screen for peptide or small molecule inhibitors or agonists 
poly-(L-lysine). Further still, intercalating agents, such as of the binding interaction. Also, the receptor PR019598 can 
ellipticine, and alkylating agents or metal complexes may be be used to isolate correlative ligand(s). Screening assays can 
attached to sense or antisense oligonucleotides to modify be designed to find lead compounds that mimic the biologj- 
binding specificities of the antisense or sense oligonucle- 5 cal activity of a native PRO 19598 or a receptor for PRO 
otide for the target nucleotide sequence. 19598. Such screening assays will include assays amenable 
Antisense or sense oligonucleotides may be introduced to high -throughput screening of chemical libraries, making 
into a cell containing the target nucleic acid sequence by any them particularly suitable for identifying small molecule 
gene transfer method, including, for example, CaP0 4 - drug candidates. Small molecules contemplated include 
mediated DNA transfection, electroporation, or by using 10 synthetic organic or inorganic compounds. The assays can 
gene transfer vectors such as Epstein-Barr virus. In a pre- be performed in a variety of formats, including protein- 
ferred procedure, an antisense or sense oligonucleotide is protein binding assays, biochemical screening assays, 
inserted into a suitable retroviral vector. A cell containing the immunoassays and cell based assays, which are well char- 
target nucleic acid sequence is contacted with the recombi- acterized in the art. 

nant retroviral vector, either in vivo or ex vivo. Suitable 15 Nucleic acids which encode PR019598 or its modified 
retroviral vectors include, but are not limited to, those forms can also be used to generate either transgenic animals 
derived from the murine retrovirus M-MuLV, N2 (a retro- or "knock out" animals which, in turn, are useful in the 
virus derived from M-MuLV), or the double copy vectors development and screening of therapeutically useful 
designated DCT5A, DCT5B and DCT5C (see WO reagents. A transgenic animal (e.g., a mouse or rat) is an 
90/13641). 20 animal having cells that contain a transgene, which trans- 
Sense or antisense oligonucleotides also may be intro- gene was introduced into the animal or an ancestor of the 
duced into a cell containing the target nucleotide sequence animal at a prenatal, e.g., an embryonic stage. A transgene 
by formation of a conjugate with a ligand binding molecule, is a DNA which is integrated into the genome of a cell from 
as described in WO 91/04753. Suitable ligand binding which a transgenic animal develops. In one embodiment, 
molecules include, but are not limited to, cell surface 25 cDNA encoding PR019598 can be used to clone genomic 
receptors, growth factors, other cytokines, or other ligands DNA encoding PRO 19598 in accordance with established 
that bind to cell surface receptors. Preferably, conjugation of techniques and the genomic sequences used to generate 
the ligand binding molecule does not substantially interfere transgenic animals that contain cells which express DNA 
with the ability of the ligand binding molecule to bind to its encoding PRO 195 98. Methods for generating transgenic 
corresponding molecule or receptor, or block entry of the 30 animals, particularly animals such as mice or rats, have 
sense or antisense oligonucleotide or its conjugated version become conventional in the art and are described, for 
into the cell. example, in U.S. Pat. Nos. 4,736,866 and 4,870,009. 

Alternatively, a sense or an antisense oligonucleotide may Typically, particular cells would be targeted for PR019598 

be introduced into a cell containing the target nucleic acid transgene incorporation with tissue-specific enhancers, 

sequence by formation of an oligonucleotide-lipid complex, 35 Transgenic animals that include a copy of a transgene 

as described in WO 90/10448. The sense or antisense encoding PR019598 introduced into the germ line of the 

oligonucleotide -lipid complex is preferably dissociated animal at an embryonic stage can be used to examine the 

within the cell by an endogenous lipase. effect of increased expression of DNA encoding PR019598. 

Antisense or sense RNA or DNA molecules are generally Such animals can be used as tester animals for reagents 

at least about 5 bases in length, about 10 bases in length, 40 thought to confer protection from, for example, pathological 

about 15 bases in length, about 20 bases in length, about 25 conditions associated with its overexpression. In accordance 

bases in length, about 30 bases in length, about 35 bases in with this facet of the invention, an animal is treated with the 

length, about 40 bases in length, about 45 bases in length, reagent and a reduced incidence of the pathological 

about 50 bases in length, about 55 bases in length, about 60 condition, compared to untreated animals bearing the 

bases in length, about 65 bases in length, about 70 bases in 45 transgene, would indicate a potential therapeutic interven- 

length, about 75 bases in length, about 80 bases in length, tion for the pathological condition. 

about 85 bases in length, about 90 bases in length, about 95 Alternatively, non-human homologues of PR019598 can 

bases in length, about 100 bases in length, or more. be used to construct a PRO 195 98 "knock out" animal which 

The probes may also be employed in PCR techniques to has a defective or altered gene encoding PRO 195 98 as a 

generate a pool of sequences for identification of closely 50 result of homologous recombination between the endog- 

related PRO 19598 coding sequences. enous gene encoding PRO 19598 and altered genomic DNA 

Nucleotide sequences encoding a PRO 195 98 can also be encoding PRO 19598 introduced into an embryonic stem cell 

used to construct hybridization probes for mapping the gene of the animal. For example, cDNA encoding PRO 19598 can 

which encodes that PRO 19598 and for the genetic analysis be used to clone genomic DNA encoding PRO 19598 in 

of individuals with genetic disorders. The nucleotide 55 accordance with established techniques. A portion of the 

sequences provided herein may be mapped to a chromosome genomic DNA encoding PRO 19598 can be deleted or 

and specific regions of a chromosome using known replaced with another gene, such as a gene encoding a 

techniques, such as in situ hybridization, linkage analysis selectable marker which can be used to monitor integration, 

against known chromosomal markers, and hybridization Typically, several kilobases of unaltered flanking DNA (both 

screening with libraries. 60 at the 5' and 3' ends) are included in the vector [see e.g., 

When the coding sequences for PR019598 encode a Thomas and Capecchi, Cell, 51:503 (1987) for a description 

protein which binds to another protein (example, where the of homologous recombination vectors]. The vector is intro- 

PR019598 is a receptor), the PR019598 can be used in duced into an embryonic stem cell line (e.g., by 

assays to identify the other proteins or molecules involved in electroporation) and cells in which the introduced DNA has 

the binding interaction. By such methods, inhibitors of the 65 homologously recombined with the endogenous DNA are 

receptor/ligand binding interaction can be identified. Pro- selected [see e.g., Li et al., Cell, 69:915 (1992)]. The 

teins involved in such binding interactions can also be used selected cells are then injected into a blastocyst of an animal 
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(e.g., a mouse or rat) to form aggregation chimeras [see e.g., ful for chromosome identification. In this regard, there exists 

Bradley, in Teratocarcinomas and Embryonic Stem Cells: A an ongoing need to identify new chromosome markers, since 

Practical Approach, E. J. Robertson, ed. (IRL, Oxford, relatively few chromosome marking reagents, based upon 

1987), pp. 113-152]. A chimeric embryo can then be actual sequence data are presently available. Each 

implanted into a suitable pseudopregnant female foster 5 PR019598 nucleic acid molecule of the present invention 

animal and the embryo brought to term to create a "knock can be used as a chromosome marker, 

out" animal. Progeny harboring the homologously recom- PR019598 polypeptides and nucleic acid molecules 

bincd DNA in their germ cells can be identified by standard of me m mvention may ^ te used diagnostically for 

techniques and used to breed animals m which all cells of the ^ { wherein me PR019598 polypeptides of the 

ammal contain the homologously recombmed DNA^ Knock. presen / mve 5 ntion may ta differentially expressed in one 

out animals can be characterized tor instance, for their f. , , . f . r .. , . 

ability to defend against certain pathological conditions and Ussue as ^P™* to ™>*er. preferably in a diseased tissue 

for their development of pathological conditions due to <° a normal l f™ of same x *f ue ^ 

absence of the PR019598 polypeptide. PR019598 nucleic acid molecules will find use for gener- 

Nucleic acid encoding the PR019598 polypeptides may atin g P robes for PCR > Northern analysis, Southern analysis 

also be used in gene therapy. In gene therapy applications, 15 a °d Western analysis. 

genes are introduced into cells in order to achieve in vivo The PR019598 polypeptides described herein may also 
synthesis of a therapeutically effective genetic product, for be employed as therapeutic agents. The PR019598 polypep- 
example for replacement of a defective gene. "Gene tides of the present invention can be formulated according to 
therapy" includes both conventional gene therapy where a known methods to prepare pharmaceutically useful 
lasting effect is achieved by a single treatment, and the 20 compositions, whereby the PR019598 product hereof is 
administration of gene therapeutic agents, which involves combined in admixture with a pharmaceutically acceptable 
the one time or repeated administration of a therapeutically carrier vehicle. Therapeutic formulations are prepared for 
effective DNA or mRNA. Antisense RNAs and DNAs can storage by mixing the active ingredient having the desired 
be used as therapeutic agents for blocking the expression of degree of purity with optional physiologically acceptable 
certain genes in vivo. It has already been shown that short 25 carriers, excipients or stabilizers (Remington's Pharmaceu- 
antisense oligonucleotides can be imported into cells where tical Sciences 16th edition, Osol, A. Ed. (1980)), in the form 
they act as inhibitors, despite their low intracellular concen- of lyophilized formulations or aqueous solutions. Accept- 
trations caused by their restricted uptake by the cell mem- able carriers, excipients or stabilizers are nontoxic to recipi- 
brane. (Zamecnik et al., Proc. Natl Acad. ScL USA ents at the dosages and concentrations employed, and 
83:4143-4146 [1986]). The oligonucleotides can be modi- 30 include buffers such as phosphate, citrate and other organic 
fied to enhance their uptake, e.g. by substituting their acids; antioxidants including ascorbic acid; low molecular 
negatively charged phosphodiester groups by uncharged weight (less than about 10 residues) polypeptides; proteins, 
groups. such as serum albumin, gelatin or immunoglobulins; hydro- 
There are a variety of techniques available for introducing philic polymers such as polyvinylpyrrolidone, amino acids 
nucleic acids into viable cells. The techniques vary depend- 35 such as glycine, glutarine, asparagine, arginine or lysine; 
ing upon whether the nucleic acid is transferred into cultured monosaccharides, disaccharides and other carbohydrates 
cells in vitro, or in vivo in the cells of the intended host. including glucose, mannose, or dextrins; chelating agents 
Techniques suitable for the transfer of nucleic acid into such as EDTA; sugar alcohols such as mannitol or sorbitol; 
mammalian cells in vitro include the use of liposomes, salt-forming counterfoils such as sodium; and/or nonionic 
electroporation, microinjection, cell fusion, DEAE-dextran, 40 surfactants such as TWEEN™, PLURONICS™ or PEG. 
the calcium phosphate precipitation method, etc. The cur- The formulations to be used for in vivo administration 
rently preferred in vivo gene transfer techniques include must be sterile. This is readily accomplished by filtration 
transfection with viral (typically retroviral) vectors and viral through sterile filtration membranes, prior to or following 
coat protein-liposome mediated transfection (Dzau et al., lyophilization and reconstitution. 

Trends in Biotechnology 11, 205-210 [1993]). In some 45 Therapeutic compositions herein generally are placed into 

situations it is desirable to provide the nucleic acid source a container having a sterile access port, for example, an 

with an agent that targets the target cells, such as an antibody intravenous solution bag or vial having a stopper pierceable 

specific for a cell surface membrane protein or the target by a hypodermic injection needle. 

cell, a ligand for a receptor on the target cell, etc. Where The route of administration is in accord with known 

liposomes are employed, proteins which bind to a cell 50 methods, e.g. injection or infusion by intravenous, 

surface membrane protein associated with endocytosis may intraperitoneal, intracerebral, intramuscular, intraocular, 

be used for targeting and/or to facilitate uptake, e.g. capsid intraarterial or intralesional routes, topical administration, or 

proteins or fragments thereof tropic for a particular cell type, by sustained release systems. 

antibodies for proteins which undergo internalization in Dosages and desired drug concentrations of pharmaceu- 

cycling, proteins that target intracellular localization and 55 tical compositions of the present invention may vary 

enhance intracellular half-life. The technique of receptor- depending on the particular use envisioned. The determina- 

mediated endocytosis is described, for example, by Wu et tion of the appropriate dosage or route of administration is 

al., J. Biol. Chem. 262, 4429-4432 (1987); and Wagner et well within the skill of an ordinary physician. Animal 

al., Proc. Natl Acad Sci. USA 87, 3410-3414 (1990). For experiments provide reliable guidance for the determination 

review of gene marking and gene therapy protocols see 60 of effective doses for human therapy. Interspecies scaling of 

Anderson et al., Science 256, 808-513 (1992). effective doses can be performed following the principles 

The PR019598 polypeptides described herein may also laid down by Mordenti, J. and Chappell, W. "The use of 

be employed as molecular weight markers for protein elec- interspecies scaling in toxicokinetics" In Toxicokinetics and 

trophoresis purposes and the isolated nucleic acid sequences New Drug Development, Yacobi et al., Eds., Pergamon 

may be used for recombinantly expressing those markers. 65 Press, New York 1989, pp. 42-96. 

The nucleic acid molecules encoding the PR019598 When in vivo administration of a PRO 19598 polypeptide 

polypeptides or fragments thereof described herein are use- or agonist or antagonist thereof is employed, normal dosage 



US 6,740,520 B2 

59 60 

amounts may vary from about 10 ng/kg to up to 100 mg/kg Non-oovalent attachment generally is accomplished by coat- 

of mammal body weight or more per day, preferably about ing the solid surface with a solution of the PRO 195 98 

1 //g/kg/day to 10 mg/kg/day, depending upon the route of polypeptide and drying. Alternatively, an immobilized 

administration. Guidance as to particular dosages and melh- antibody, e.g., a monoclonal antibody, specific for the 

ods of delivery is provided in the literature; see, for example, 5 PR019598 polypeptide to be immobilized can be used to 

U.S. Pat. Nos. 4,657,760; 5,206344; or 5,225,212. It is anchor it to a solid surface. The assay is performed by 

anticipated that different formulations will be effective for adding the non-immobilized component, which may be 

different treatment compounds and different disorders, that labeled by a detectable label, to the immobilized component, 

administration targeting one organ or tissue, for example, e.g., the coated surface containing the anchored component, 

may necessitate delivery in a manner different from that to 10 When the reaction is complete, the non-reacted components 

another organ or tissue. are removed, e.g., by washing, and complexes anchored on 

Where sustained-release administration of a PR019598 the solid surface are detected. When the originally non- 
polypeptide is desired in a formulation with release charac- immobilized component carries a detectable label, the detec- 
teristics suitable for the treatment of any disease or disorder lion of label immobilized on the surface indicates that 
requiring administration of the PR019598 polypeptide, 15 complexing occurred. Where the originally non- 
microencapsulation of the PRO 195 98 polypeptide is con- immobilized component does not carry a label, complexing 
templated. Microencapsulation of recombinant proteins for can be detected, for example, by using a labeled antibody 
sustained release has been successfully performed with specifically binding the immobilized complex, 
human growth hormone (rhGH), interferon- (rhlFN- ), If the candidate compound interacts with but does not 
interleukin-2, and MN rgpl20. Johnson et al., Nat. Med., 20 bind to a particular PRO 19598 polypeptide encoded by a 
2:795-799 (1996); Yasuda, Biomed. Then, 27:1221-1223 gene identified herein, its interaction with that polypeptide 
(1993); Hora et al., Bio/Technology, 8:755-758 (1990); can be assayed by methods well known for detecting 
Cleland, "Design and Production of Single Immunization protein -protein interactions. Such assays include traditional 
Vaccines Using Polylactide Polyglycolide Microsphere approaches, such as, e.g., cross-linking, 
Systems," in Vaccine Design: The Subunit and Adjuvant 25 co-immunoprecipitation, and co-purification through gradi- 
Approach, Powell and Newman, eds, (Plenum Press: New ents or chromatographic columns. In addition, protein- 
York, 1995), pp. 439-462; WO 97/03692, WO 96/40072, protein interactions can be monitored by using a yeast-based 
WO 96/07399; and U.S. Pat. No. 5,654,010. genetic system described by Fields and co-workers (Fields 

The sustained-release formulations of these proteins were and Song, Nature (London), 340:245-246 (1989); Chienet 
developed using poly-lactic-coglycolic acid (PLGA) poly- 30 al., Proc. Natl Acad. Sci. USA, 88:9578-9582 (1991)) as 
mer due to its biocompatibility and wide range of biode- disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. 
gradable properties. The degradation products of PLGA, USA, 89: 5789-5793 (1991); Many transcriptional 
lactic and glycolic acids, can be cleared quickly within the activators, such as yeast GAL4, consist of two physically 
human body. Moreover, the degradability of this polymer discrete modular domains, one acting as the DNA-binding 
can be adjusted from months to years depending on its 35 domain, the other one functioning as the transcription- 
molecular weight and composition. Lewis, "Controlled activation domain. The yeast expression system described in 
release of bioactive agents from lactide/glycolide polymer," the foregoing publications (generally referred to as the 
in: M. Chasin and R. Langer (Eds.), Biodegradable Poly- "two-hybrid system") takes advantage of this property, and 
mers as Drug Delivery Systems (Marcel Dekker: New York, employs two hybrid proteins, one in which the target protein 
1990), pp. 1-41. 40 is frised to the DNA-binding domain of GAL4, and another, 

This invention encompasses methods of screening com- in which candidate activating proteins are fused to the 
pounds to identify those that mimic the PRO 195 98 polypep- activation domain. The expression of a GALl-lacZ reporter 
tide (agonists) or prevent the effect of the PR019598 gene under control of a GAL4-activated promoter depends 
polypeptide (antagonists). Screening assays for antagonist on reconstitution of GAL4 activity via protein-protein inter- 
drug candidates are designed to identify compounds that 45 action. Colonies containing interacting polypeptides are 
bind or complex with the PRO 195 98 polypeptides encoded detected with a chromogenic substrate for (3-galactosidase. A 
by the genes identified herein, or otherwise interfere with the complete kit (MATCHMAKER™) for identifying protein- 
interaction of the encoded polypeptides with other cellular protein interactions between two specific proteins using the 
proteins. Such screening assays will include assays ame- two-hybrid technique is commercially available from Clon- 
nable to high -throughput screening of chemical libraries, 50 tech. This system can also be extended to map protein 
making them particularly suitable for identifying small domains involved in specific protein interactions as well as 
molecule drug candidates. to pinpoint amino acid residues that are crucial for these 

The assays can be performed in a variety of formats, interactions, 

including protein-protein binding assays, biochemical Compounds that interfere with the interaction of a gene 

screening assays, immunoassays, and cell-based assays, 55 encoding a PRO 19598 polypeptide identified herein and 

which are well characterized in the art. other intra- or extracellular components can be tested as 

All assays for antagonists are common in that they call for follows: usually a reaction mixture is prepared containing 

contacting the drug candidate with a PRO 195 98 polypeptide the product of the gene and the intra- or extracellular 

encoded by a nucleic acid identified herein under conditions component under conditions and for a time allowing for the 

and for a time sufficient to allow these two components to 60 interaction and binding of the two products. To test the 

interact. ability of a candidate compound to inhibit binding, the 

In binding assays, the interaction is binding and the reaction is run in the absence and in the presence of the test 

complex formed can be isolated or detected in the reaction compound. In addition, a placebo may be added to a third 

mixture. In a particular embodiment, the PRO 195 98 reaction mixture, to serve as positive control. The binding 

polypeptide encoded by the gene identified herein or the 65 (complex formation) between the test compound and the 

drug candidate is immobilized on a solid phase, e.g., on a intra- or extracellular component present in the mixture is 

microliter plate, by covalent or non-covalent attachments. monitored as described hereinabove. The formation of a 
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complex io the control reaction(s) but not in the reaction 
mixture containing the test compound indicates that the test 
compound interferes with the interaction of the test com- 
pound and its reaction partner. 

To assay for antagonists, the PR019598 polypeptide may 
be added to a cell along with the compound to be screened 
for a particular activity and the ability of the compound to 
inhibit the activity of interest in the presence of the 
PR019598 polypeptide indicates that the compound is an 
antagonist to the PR019598 polypeptide. Alternatively, 
antagonists may be detected by combining the PR019598 
polypeptide and a potential antagonist with membrane- 
bound PRO 19598 polypeptide receptors or recombinant 
receptors under appropriate conditions for a competitive 
inhibition assay. The PRO 19598 polypeptide can be labeled, 
such as by radioactivity, such that the number of PRO 195 98 
polypeptide molecules bound to the receptor can be used to 
determine the effectiveness of the potential antagonist. The 
gene encoding the receptor can be identified by numerous 
methods known to those of skill in the art, for example, 
ligand panning and FACS sorting. Coligan et al., Current 
Protocols in Immun., 1(2): Chapter 5 (1991). Preferably, 
expression cloning is employed wherein polyadenylated 
RNA is prepared from a cell responsive to the PRO 195 98 
polypeptide and a cDNA library created from this RNA is 
divided into pools and used to transfect COS cells or other 
cells that are not responsive to the PR019598 polypeptide. 
Transfected cells that are grown on glass slides are exposed 
to labeled PR019598 polypeptide. The PR019598 polypep- 
tide can be labeled by a variety of means including iodina- 
tion or inclusion of a recognition site for a site-specific 
protein kinase. Following fixation and incubation, the slides 
are subjected to autoradiographic analysis. Positive pools 
are identified and sub-pools are prepared and re-transfected 
using an interactive sub-pooling and re-screening process, 
eventually yielding a single clone that encodes the putative 
receptor. 

As an alternative approach for receptor identification, 
labeled PRO 195 98 polypeptide can be photoaflSnity-linked 
with cell membrane or extract preparations that express the 
receptor molecule. Cross-linked material is resolved by 
PAGE and exposed to X-ray film. The labeled complex 
containing the receptor can be excised, resolved into peptide 
fragments, and subjected to protein micro-sequencing. The 
amino acid sequence obtained from micro-sequencing 
would be used to design a set of degenerate oligonucleotide 
probes to screen a cDNA library to identify the gene 
encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a 
membrane preparation expressing the receptor would be 
incubated with labeled PRO 195 98 polypeptide in the pres- 
ence of the candidate compound. The ability of the com- 
pound to enhance or block this interaction could then be 
measured. 

More specific examples of potential antagonists include 
an oligonucleotide that binds to the fusions of immunoglo- 
bulin with PRO 195 98 polypeptide, and, in particular, anti- 
bodies including, without limitation, poly- and monoclonal 
antibodies and antibody fragments, single-chain antibodies, 
anti-idiotypic antibodies, and chimeric or humanized ver- 
sions of such antibodies or fragments, as well as human 
antibodies and antibody fragments. Alternatively, a potential 
antagonist may be a closely related protein, for example, a 
mutated form of the PR019598 polypeptide that recognizes 
the receptor but imparts no effect, thereby competitively 
inhibiting the action of the PR019598 polypeptide. 

Another potential PR019598 polypeptide antagonist is an 
antisense RNA or DNA construct prepared using antisense 
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technology, where, e.g., an antisense RNA or DNA molecule 
acts to block directly the translation of mRNA by hybrid- 
izing to targeted mRNA and preventing protein translation. 
Antisense technology can be used to control gene expression 
through triple-helix formation or antisense DNA or RNA, 
both of which methods are based on binding of a polynucle- 
otide to DNA or RNA. For example, the 5* coding portion of 
the polynucleotide sequence, which encodes the mature 
PR019598 polypeptides herein, is used to design an anti- 
sense RNA oligonucleotide of from about 10 to 40 base pairs 
in length. A DNA oligonucleotide is designed to be comple- 
mentary to a region of the gene involved in transcription 
(triple helix— see Lee et al., NucL Acids Res., 6:3073 (1979); 
Cooney et al., Science, 241: 456 (1988); Dervan et al., 
Science, 251:1360 (1991)), thereby preventing transcription 
and the production of the PRO 19598 polypeptide. The 
antisense RNA oligonucleotide hybridizes to the mRNA in 
vivo and blocks translation of the mRNA molecule into the 
PR019598 polypeptide (antisense — Okano, Neurochem., 
56:560 (1991); Oligodeoxvnucleotides as Antisense Initia- 
tors of Gene Expression (CRC Press: Boca Raton, Fla., 
1988). The oligonucleotides described above can also be 
delivered to cells such that the antisense RNA or DNA may 
be expressed in vivo to inhibit production of the PR019598 
polypeptide. When antisense DNA is used, oligodeoxyribo- 
nucleotides derived from the translation-initiation site, e.g., 
between about -10 and +10 positions of the target gene 
nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to 
the active site, the receptor binding site, or growth factor or 
other relevant binding site of the PR019598 polypeptide, 
thereby blocking the normal biological activity of the 
PR019598 polypeptide. Examples of small molecules 
include, but are not limited to, small peptides or peptide-like 
molecules, preferably soluble peptides, and synthetic non- 
peptidyl organic or inorganic compounds. 

Ribozymes are enzymatic RNA molecules capable of 
catalyzing the specific cleavage of RNA. Ribozymes act by 
sequence-specific hybridization to the complementary target 
RNA, followed by endonucleolytic cleavage. Specific 
ribozyme cleavage sites within a potential RNA target can be 
identified by known techniques. For further details see, e.g., 
Rossi, Current Biology, 4:469^71 (1994), and PCT publi- 
cation No. WO 97/33551 (published Sep. 18, 1997). 

Nucleic acid molecules in triple-helix formation used to 
inhibit transcription should be single-stranded and com- 
posed of deoxynucleotides. The base composition of these 
oligonucleotides is designed such that it promotes triple - 
helix formation via Hoogsteen base-pairing rules, which 
generally require sizeable stretches of purines or pyrim- 
idines on one strand of a duplex. For further details see, e.g., 
PCT publication No. WO 97/33551, supra. 

These small molecules can be identified by any one or 
more of the screening assays discussed hereinabove and/or 
by any other screening techniques well known for those 
skilled in the art. 

Diagnostic and therapeutic uses of the herein disclosed 
molecules may also be based upon the positive functional 
assay hits disclosed and described below. 
F. Anti-PR019598 Antibodies 

The present invention further provides anti-PR019598 
antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific, and heteroconjugate 
antibodies. 

1. Polyclonal Antibodies 

The anti-PROl9598 antibodies may comprise polyclonal 
antibodies. Methods of preparing polyclonal antibodies are 
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known to the skilled artisan. Polyclonal antibodies can be 
raised in a mammal, for example, by one or more injections 
of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be 
injected in the mammal by multiple subcutaneous or intra- 
peritoneal injections. The immunizing agent may include the 
PR019598 polypeptide or a fusion protein thereof. It may be 
useful to conjugate the immunizing agent to a protein known 
to be immunogenic in the mammal being immunized. 
Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. 
Examples of adjuvants which may be employed include 
Freund's complete adjuvant and MPL-TDM adjuvant 
(monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be 
selected by one skilled in the art without undue experimen- 
tation. 

2. Monoclonal Antibodies 

The anti-PR019598 antibodies may, alternatively, be 
monoclonal antibodies. Monoclonal antibodies may be pre- 
pared using hybridoma methods, such as those described by 
Kohler and Milstein, Nature, 256:495 (1975). In a hybri- 
doma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing 
antibodies that will specifically bind to the immunizing 
agent. Alternatively, the lymphocytes may be immunized in 
vitro. 

The immunizing agent will typically include the 
PR019598 polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes ("PBLs") 
are used if cells of human origin are desired, or spleen cells 
or lymph node cells are used if non-human mammalian 
sources are desired. The lymphocytes are then fused with an 
immortalized cell line using a suitable fusing agent, such as 
polyethylene glycol, to form a hybridoma cell [Goding, 
Monoclonal Antibodies: Principles and Practice, Academic 
Press, (1986) pp. 59-103]. Immortalized cell lines are usu- 
ally transformed mammalian cells, particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or 
mouse myeloma cell lines are employed. The hybridoma 
cells may be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the 
growth or survival of the unfused, immortalized cells. For 
example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include 
hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT- 
deficient cells. 

Preferred immortalized cell lines are those that fuse 
efficiently, support stable high level expression of antibody 
by the selected antibody-producing cells, and are sensitive to 
a medium such as HAT medium. More preferred immortal- 
ized cell lines are murine myeloma lines, which can be 
obtained, for instance, from the Salk Institute Cell Distri- 
bution Center, San Diego, Calif, and the American Type 
Culture Collection, Manassas, Va. Human myeloma and 
mouse-human heteromyeloma cell lines also have been 
described for the production of human monoclonal antibod- 
ies [Kozbor, J, Immunol, 133:3001 (1984); Brodeur et al., 
Monoclonal Antibody Production Techniques and 
Applications, Marcel Dekker, Inc., New York, (1987) pp. 
51-63]. 

The culture medium in which the hybridoma cells are 
cultured can then be assayed for the presence of monoclonal 
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antibodies directed against PR019598. Preferably, the bind- 
ing specificity of monoclonal antibodies produced by the 
hybridoma cells is determined by immunoprecipitation or by 
an in vitro binding assay, such as radioimmunoassay (RIA) 
or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art. The binding 
affinity of the monoclonal antibody can, for example, be 
determined by the Scatchard analysis of Munson and 
Pollard, Anal Biochem., 107:220 (1980). 

After the desired hybridoma cells are identified, the 
clones may be subcloned by limiting dilution procedures and 
grown by standard methods [Goding, supra]. Suitable cul- 
ture media for this purpose include, for example, Dulbecco's 
Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells may be grown in vivo as 
ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may 
be isolated or purified from the culture medium or ascites 
fluid by conventional immunoglobulin purification proce- 
dures such as, for example, protein A-Sepharose, hydroxy- 
lapatite chromatography, gel electrophoresis, dialysis, or 
affinity chromatography. 

The monoclonal antibodies may also be made by recom- 
binant DNA methods, such as those described in U.S. Pat. 
No. 4,816,567. DNA encoding the monoclonal antibodies of 
the invention can be readily isolated and sequenced using 
conventional procedures (e.g., by using oligonucleotide 
probes that are capable of binding specifically to genes 
encoding the heavy and light chains of murine antibodies). 
The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA may be placed 
into expression vectors, which are then transfected into host 
cells such as simian COS cells, Chinese hamster ovary 
(CHO) cells, or myeloma cells that do not otherwise produce 
immunoglobulin protein, to obtain the synthesis of mono- 
clonal antibodies in the recombinant host cells. The DNA 
also may be modified, for example, by substituting the 
coding sequence for human heavy and light chain constant 
domains in place of the homologous murine sequences [U.S. 
Pat. No. 4,816,567; Morrison et al., supra] or by covalently 
joining to the immunoglobulin coding sequence all or part of 
the coding sequence for a non -immunoglobulin polypeptide. 
Such a non-inumunoglobulin polypeptide can be substituted 
for the constant domains of an antibody of the invention, or 
can be substituted for the variable domains of one antigen- 
combining site of an antibody of the invention to create a 
chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods 
for preparing monovalent antibodies are well known in the 
art. For example, one method involves recombinant expres- 
sion of immunoglobulin light chain and modified heavy 
chain. The heavy chain is truncated generally at any point in 
the Fc region so as to prevent heavy chain crosslinking. 
Alternatively, the relevant cysteine residues are substituted 
with another amino acid residue or are deleted so as to 
prevent crosslinking. 

In vitro methods are also suitable for preparing monova- 
lent antibodies. Digestion of antibodies to produce frag- 
ments thereof, particularly, Fab fragments, can be accom- 
plished using routine techniques known in the art. 
3. Human and Humanized Antibodies 

The anti-PROl9598 antibodies of the invention may 
further comprise humanized antibodies or human antibodies. 
Humanized forms of non-human (e.g., murine) antibodies 
are chimeric immunoglobulins, immunoglobulin chains or 
fragments thereof (such as Fv, Fab, Fab 1 , F(ab ! ) 2 or other 
antigen-binding subsequences of antibodies) which contain 
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minimal sequence derived from non-human immunoglobu- 
lin. Humanized antibodies include human immunoglobulins 
(recipient antibody) in which residues from a complemen- 
tary determining region (CDR) of the recipient are replaced 
by residues from a CDR of a non-human species (donor 
antibody) such as mouse, rat or rabbit having the desired 
specificity, affinity and capacity. In some instances, Fv 
framework residues of the human immunoglobulin are 
replaced by corresponding non-human residues. Humanized 
antibodies may also comprise residues which are found 
neither in the recipient antibody nor in the imported CDR or 
framework sequences. In general, the humanized antibody 
will comprise substantially all of at least one, and typically 
two, variable domains, in which all or substantially all of the 
CDR regions correspond to those of a non-human immu- 
noglobulin and all or substantially all of the FR regions are 
those of a human immunoglobulin consensus sequence. The 
humanized antibody optimally also will comprise at least a 
portion of an immunoglobulin constant region (Fc), typically 
that of a human immunoglobulin [Jones et al., Nature, 
321:522-525 (1986); Riechmann et al., Nature, 
332:323-329 (1988); and Presta, Curr. Op. Struct. Bio., 
2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well 
known in the art. Generally, a humanized antibody has one 
or more amino acid residues introduced into it from a source 
which is non-human. These non-human amino acid residues 
are often referred to as "import" residues, which are typi- 
cally taken from an "import" variable domain. Humaniza- 
tion can be essentially performed following the method of 
Winter and co-workers [Jones et al., Nature, 321:522-525 
(1986); Riechmann et al., Nature, 332:323-327 (1988); 
Verhoeyen et al., Science, 239: 1534-1536 (1988)], by 
substituting rodent CDRs or CDR sequences for the corre- 
sponding sequences of a human antibody. Accordingly, such 
"humanized" antibodies are chimeric antibodies (U.S. Pat. 
No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corre- 
sponding sequence from a non -human species. In practice, 
humanized antibodies are typically human antibodies in 
which some CDR residues and possibly some FR residues 
are substituted by residues from analogous sites in rodent 
antibodies. 

Human antibodies can also be produced using various 
techniques known in the art, including phage display librar- 
ies [Hoogenboom and Winter,/. Mol Biol, 227:381 (1991); 
Marks et al.,/. Mol Biol, 222:581 (1991)]. The techniques 
of Cole et al. and Boerner et al. are also available for the 
preparation of human monoclonal antibodies (Cole et al., 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
p. 77 (1985) and Boerner et al., J. Invnunol, 147(l):86-95 
(1991)]. Similarly, human antibodies can be made by intro- 
ducing of human immunoglobulin loci into transgenic 
animals, e.g., mice in which the endogenous immunoglo- 
bulin genes have been partially or completely inactivated. 
Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in all respects, 
including gene rearrangement, assembly, and antibody rep- 
ertoire. This approach is described, for example, in U.S. Pat. 
Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633, 
425; 5,661,016, and in the following scientific publications: 
Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg 
et al., Nature 368 856-859 (1994); Morrison, Nature 368, 
812-13 (1994); Fishwild et al., Nature Biotechnology 14, 
845-51 (1996); Neuberger, Nature Biotechnology 14, 826 
(1996); Lonberg and Huszar, Intern. Rev. Immunol 13 
65-93 (1995). 



The antibodies may also be affinity matured using known 
selection and/or mutagenesis methods as described above. 
Preferred affinity matured antibodies have an affinity which 
is five times, more preferably 10 times, even more prefer- 

5 ably 20 or 30 times greater than the starting antibody 
(generally murine, humanized or human) from which the 
matured antibody is prepared. 
4. Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human 

10 or humanized, antibodies that have binding specificities for 
at least two different antigens. In the present case, one of the 
binding specificities is for the PRO 19598, the other one is 
for any other antigen, and preferably for a cell-surface 
protein or receptor or receptor subunit. 

15 Methods for making bispecific antibodies are known in 
the art. Traditionally, the recombinant production of bispe- 
cific antibodies is based on the co-expression of two immu- 
noglobulin heavy-chain/light-chain pairs, where the two 
heavy chains have different specificities [Milstein and 

20 Cuello, Nature, 305:537-539 (1983)]. Because of the ran- 
dom assortment of immunoglobulin heavy and light chains, 
these hybridomas (quadromas) produce a potential mixture 
of ten different antibody molecules, of which only one has 
the correct bispecific structure. The purification of the cor- 

25 rect molecule is usually accomplished by affinity chroma- 
tography steps. Similar procedures are disclosed in WO 
93/08829, published May 13, 1993, and in Traunecker et al., 
EMBOJ., 10:3655-3659 (1991). 

Antibody variable domains with the desired binding 

30 specificities (antibody-antigen combining sites) can be fused 
to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant 
domain, comprising at least part of the hinge, CH2, and CH3 
regions. It is preferred to have the first heavy-chain constant 

35 region (CHI) containing the site necessary for light-chain 
binding present in at least one of the fusions. DNAs encod- 
ing the immunoglobulin heavy-chain fusions and, if desired, 
the immunoglobulin light chain, are inserted into separate 
expression vectors, and are co-transfected into a suitable 

40 host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al., Methods in 
Enzymology, 121:210 (1986). 

According to another approach described in WO 
96/27011, the interface between a pair of antibody mol- 

45 ecules can be engineered to maximize the percentage of 
heterodimers which are recovered from recombinant cell 
culture. The preferred interface comprises at least a part of 
the CH3 region of an antibody constant domain. In this 
method, one or more small amino acid side chains from the 

50 interface of the first antibody molecule are replaced with 
larger side chains (e.g. tyrosine or tryptophan). Compensa- 
tory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody 
molecule by replacing large amino acid side chains with 

55 smaller ones (e.g. alanine or threonine). This provides a 
mechanism for increasing the yield of the heterodimer over 
other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as mil length anti- 
bodies or antibody fragments (e.g. F(ab') 2 bispecific 

60 antibodies). Techniques for generating bispecific antibodies 
from antibody fragments have been described in the litera- 
ture. For example, bispecific antibodies can be prepared can 
be prepared using chemical linkage. Brennan et al., Science 
229:81 (1985) describe a procedure wherein intact antibod- 

65 ies are proteolytically cleaved to generate F(ab') 2 fragments. 
These fragments are reduced in the presence of the dithiol 
complexing agent sodium arsenite to stabilize vicinal dithi- 
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ols and prevent intermolecular disulfide formation. The Fab' treatment of HIV infection [WO 91/00360; WO 92/200373; 

fragments generated are then converted to thionitrobenzoate EP 03089]. It is contemplated that the antibodies may be 

(TNB) derivatives. One of the Fab'-TNB derivatives is then prepared in vitro using known methods in synthetic protein 

reconverted to the Fab'-thiol by reduction with mercaptoet- chemistry, including those involving crosslinking agents, 

hylamine and is mixed with an equimolar amount of the 5 For example, immunotoxins may be constructed using a 

other Fab'-TNB derivative to form the bispecific antibody. disulfide exchange reaction or by forming a thioether bond. 

The bispecific antibodies produced can be used as agents for Examples of suitable reagents for this purpose include 

the selective immobilization of enzymes. iminothiolate and metbyl-4-mercaptobutyrimidate and those 

Fab 1 fragments may be directly recovered from£. coli and disclosed, for example, in U.S. Pat. No. 4,676,980. 

chemically coupled to form bispecific antibodies. Shalaby et 10 6. Effector Function Engineering 

al., J. Exp. Med. 175:217-225 (1992) describe the produc- It may be desirable to modify the antibody of the inven- 
tion of a fully humanized bispecific antibody F(ab') 2 mol- tion with respect to effector function, so as to enhance, e.g., 
ecule. Each Fab' fragment was separately secreted from E. the effectiveness of the antibody in treating cancer. For 
coli and subjected to directed chemical coupling in vitro to example, cysteine residues) may be introduced into the Fc 
form the bispecific antibody. The bispecific antibody thus 15 region, thereby allowing interchain disulfide bond formation 
formed was able to bind to cells overexpressing the ErbB2 in this region. The homodimeric antibody thus generated 
receptor and normal human T cells, as well as trigger the may have improved internalization capability and/or 
lytic activity of human cytotoxic lymphocytes against increased complement-mediated cell killing and antibody- 
human breast tumor targets. dependent cellular cytotoxicity (ADCC). See Caron et al.,7. 

Various technique for making and isolating bispecific 20 Exp Med., 176: 1191-1195 (1992) and Shopes, J. Immunol, 

antibody fragments directly from recombinant cell culture 148: 2918-2922 (1992). Homodimeric antibodies with 

have also been described. For example, bispecific antibodies enhanced an ti -tumor activity may also be prepared using 

have been produced using leucine zippers. Kostelny et al.,7. heterobifunctional cross-linkers as described in Wolff et al. 

Immunol 148(5): 1547-1553 (1992). The leucine zipper Cancer Research, 53: 2560-2565 (1993). Alternatively, an 

peptides from the Fos and Jun proteins were linked to the 25 antibody can be engineered that has dual Fc regions and may 

Fab' portions of two different antibodies by gene fusion. The thereby have enhanced complement lysis and ADCC capa- 

antibody homodimers were reduced at the hinge region to bilities. See Stevenson et al., Anti-Cancer Drug Design, 3: 

form monomers and then re-oxidized to form the antibody 219-230 (1989). 

heterodimers. This method can also be utilized for the 7. Immunoconjugates 

production of antibody homodimers. The "diabody" tech- 30 The invention also pertains to immunoconjugates com- 

nology described by Hollinger et al., Proc. Natl Acad. Sci. prising an antibody conjugated to a cytotoxic agent such as 

USA 90:6444—6448 (1993) has provided an alternative a chemotherapeutic agent, toxin (e.g., an enzymatically 

mechanism for making bispecific antibody fragments. The active toxin of bacterial, fungal, plant, or animal origin, or 

fragments comprise a heavy-chain variable domain (V,,) fragments thereof), or a radioactive isotope (i.e., a 

connected to a light-chain variable domain (V L ) by a linker 35 radiooonjugate). 

which is too short to allow pairing between the two domains Chemotherapeutic agents useful in the generation of such 

on the same chain. Accordingly, the \ H and V x domains of immunoconjugates have been described above. Enzymati- 

one fragment are forced to pair with the complementary V^ cally active toxins and fragments thereof that can be used 

and V H domains of another fragment, thereby forming two include diphtheria A chain, nonbinding active fragments of 

antigen-binding sites. Another strategy for making bispecific 40 diphtheria toxin, exotoxin A chain (from Pseudomonas 

antibody fragments by the use of single-chain Fv (sFv) aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 

dimers has also been reported. See, Gruber et al.,7. Immu- alpha -sarcin, Aleurites fordii proteins, dianthin proteins, 

nol 152:5368(1994). Antibodies with more than two valen- Phytolaca americana proteins (PAPI, PAPII, and PAP-S), 

cies are contemplated. For example, trispecific antibodies momordica charantia inhibitor, curcin, crotin, sapaonaria 

can be prepared. Tutt el al., J. Immunol 147:60 (1991). 45 officinalis inhibitor, gelonin, milogellin, restrictocin, 

Exemplary bispecific antibodies may bind to two different phenomycin, enomycin, and the tricothecenes. A variety of 

epitopes on a given PRO 19598 polypeptide herein. radionuclides are available for the production of radiocon- 

Alternatively, an anti-PR019598 polypeptide arm may be jugated antibodies. Examples include 212 Bi, 131 1, 131 In, 90 Y, 

combined with an arm which binds to a triggering molecule and 186 Re. Conjugates of the antibody and cytotoxic agent 

on a leukocyte such as a T-cell receptor molecule (e.g. CD2, 50 are made using a variety of Afunctional protein -coupling 

CD3, CD28, or B7), or Fc receptors for IgG (FcyR), such as agents such as N-succinimidyl-3-(2-pyridyldithiol) propi- 

FcyRI (CD64), FcyRII (CD32) and FcvRIII (CD 16) so as to onate (SPDP), ininothiolane (IT), bifunctional derivatives of 

focus cellular defense mechanisms to the cell expressing the imidoesters (such as dimethyl adipimidate HCL), active 

particular PR019598 polypeptide. Bispecific antibodies esters (such as disuccinimidyl suberate), aldehydes (such as 

may also be used to localize cytotoxic agents to cells which 55 glutareldehyde), bis-azido compounds (such as bis 

express a particular PRO 19598 polypeptide. These antibod- (p-azidobenzoyl)hexanedi amine), bis-diazonium derivatives 

ies possess a PR019598-binding arm and an arm which (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diiso- 

binds a cytotoxic agent or a radionuclide chelator, such as cyanates (such as tolyene 2,6-diisocyanate), and bis-active 

EOTUBE, DPTA, DOTA, or TETA. Another bispecific fluorine compounds (such as l,5-difluoro-2,4- 

antibody of interest binds the PR019598 polypeptide and 60 dinitrobenzene). For example, a ricin inmnunotoxin can be 

further binds tissue factor (TF). prepared as described in Vitetta et al., Science, 238: 1098 

5. Heteroconjugate Antibodies (1987). Carbon-14-labeled l-isothiocyanatobenzyl-3- 

Heteroconjugate antibodies are also within the scope of methyldiethylene triaminepentaacetic acid (MX-DTPA) is 

the present invention. Heteroconjugate antibodies are com- an exemplary chelating agent for conjugation of radionucle- 

posed of two covalently joined antibodies. Such antibodies 65 otide to the antibody. See WO94/11026. 

have, for example, been proposed to target immune system In another embodiment, the antibody may be conjugated 

cells to unwanted cells [U.S. Pat. No. 4,676,980], and for to a "receptor" (such streptavidin) for utilization in tumor 
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pretargeting wherein the antibody-receptor conjugate is meable matrices of solid hydrophobic polymers containing 

administered to the patient, followed by removal of unbound the antibody, which matrices are in the form of shaped 

conjugate from the circulation using a clearing agent and articles, e.g., films, or microcapsules. Examples of 

then administration of a "ligand" (e.g., avidin) that is con- sustained-release matrices include polyesters, hydrogels (for 

^^^c^^^ & nX^g.^^dk>uacleom- 5 example, poly(2-hydroxyetbyl-methacrylate), or poly 

* dtelosed herein may also be formulated as (^alcohol)), polylactides (U.S Pat. No. 3,773,919), 

immunoliposomes. Liposomes containing the antibody are copolymers of L-glutamic acid and y ethyl-L-glutamate, 

prepared by methods known in the art, such as described in non-degradable ethylene-vinyl acetate, degradable lactic 

Epstein et al., Proc. Natl Acad ScL USA, 82: 3688(1985); acid-glycolic acid copolymers such as the LUPRON 

Hwang et al., Proc. Natl Acad. ScL USA, 77: 4030 (1980); DEPOT™ (injectable microspheres composed of lactic acid- 

and U.S. Pat. Nos. 4,485,045 and 4,544,545. Liposomes glycolic acid copolymer and leuprolide acetate), and poly- 

with enhanced circulation time are disclosed in U.S. Pat. No. D-(-)-3-hydroxybutyric acid. While polymers such as 

5,013,556. ethylene-vinyl acetate and lactic acid-glycolic acid enable 

Particularly useful liposomes can be generated by the release of molecules for over 100 days? cerlain hydrogels 

reverse-phase evaporation method with a hpid composiUon « ^ for ^ x ^ Ms ^ M 

comprising phosphatidylcholine, cholesterol, and PEG- .,_ . . f . . e , . . 

derivatized phosphatidylethanolamine (PEG-PE). Lipo- antibodies remain m the body for a long time, they may 

somes are extruded through filters of defined pore size to denature or aggregate as a result of exposure to moisture at 

yield liposomes with the desired diameter. Fab' fragments of 37 ° C., resulting in a loss of biological activity and possible 

the antibody of the present invention can be conjugated to 20 changes in immunogenicity. Rational strategies can be 

the liposomes as described in Martin et al., J. Biol Chem., devised for stabilization depending on the mechanism 

257: 286-288 (1982) via a disulfide -interchange reaction. A involved. For example, if the aggregation mechanism is 

chemotherapeutic agent (such as Doxorubicin) is optionally discovered to be intermolecular S— S bond formation 

contained within the liposome. See Gabizon et al., J. through thio-disulfide interchange, stabilization may be 

National Cancer Inst., 81(19): 1484 (1989). 25 achieved by modifying sulfhydryl residues, lyophilizing 

9. Pharmaceutical Compositions of Antibodies from acidic solutions, controlling moisture content, using 

Antibodies specifically binding a PR019598 polypeptide appropriate additives, and developing specific polymer 

identified herein, as well as other molecules identified by the matrix compositions, 

screening assays disclosed hereinbefore, can be adminis- ^ TT _ . nnA1rtfAO A ,. 

tered for the treatment of various disorders in the form of 30 Uses for *nti-PROl9598 Antibodies 

pharmaceutical compositions. The anti-PROl9598 antibodies of the invention have 

If the PR019598 polypeptide is intracellular and whole various utilities. For example, anti-PROl9598 antibodies 
antibodies are used as inhibitors, internalizing antibodies are mav be used in diagnostic assays for PR019598, e.g., 
preferred. However, lipofections or liposomes can also be detecting its expression (and in some cases, differential 
used to deliver the antibody, or an antibody fragment, into 35 expression) in specific cells, tissues, or serum. Various 
cells. Where antibody fragments are used, the smallest diagnostic assay techniques known in the art may be used, 
inhibitory fragment that specifically binds to the binding sucn as competitive binding assays, direct or indirect sand- 
domain of the target protein is preferred. For example, based ™ich assays and immunoprecipitation assays conducted in 
upon the variable-region sequences of an antibody, peptide either heterogeneous or homogeneous phases [Zola, Mono- 
molecules can be designed that retain the ability to bind the 40 clonal Antibodies: A Manual of Techniques, CRC Press, Inc. 
target protein sequence. Such peptides can be synthesized (1987) pp. 1 47-158]. The antibodies used in the diagnostic 
chemically and/or produced by recombinant DNA technol- assays can be labeled with a detectable moiety. The detect- 
ogy. See, e.g., Marasco et al., Proc. Natl Acad ScL USA, 90: able moiety should be capable of producing, either directly 
7889-7893 (1993). The formulation herein may also contain or indirectly, a detectable signal. For example, the detectable 
more than one active compound as necessary for the par- 45 moiety may be a radioisotope, such as 3 H, 14 C, 32 p, 35 S, or 
ticular indication being treated, preferably those with l25l > a fluorescent or chemiluminescent compound, such as 
complementary activities that do not adversely affect each fluorescein isothiocyanate, rhodamine, or luciferin, or an 
other. Alternatively, or in addition, the composition may enzyme, such as alkaline phosphatase, beta-galactosidase or 
comprise an agent that enhances its function, such as, for horseradish peroxidase. Any method known in the art for 
example, a cytotoxic agent, cytokine, chemotherapeutic 50 conjugating the antibody to the detectable moiety may be 
agent, or growth-inhibitory agent. Such molecules are suit- employed, including those methods described by Hunter et 
ably present in combination in amounts that are effective for al., Nature, 144:945 (1962); David et al., Biochemistry, 
the purpose intended. 13:1014 (1974); Pain et al., J. Immunol Meth., 40:219 

The active ingredients may also be entrapped in micro- (1981); and Nygren, J. Histochem. and Cytochem., 30:407 

capsules prepared, for example, by coacervation techniques 55 (1982). 

or by interfacial polymerization, for example, hydroxym- Anti-PR019598 antibodies also are useful for the affinity 

ethylcellulose or gelatin-microcapsules and poly- purification of PR019598 from recombinant cell culture or 

(methylmethacylate) microcapsules, respectively, in colloi- natural sources. In this process, the antibodies against 

dal drug delivery systems (for example, liposomes, albumin PR019598 are immobilized on a suitable support, such a 

microspheres, microemulsions, nano-particles, and 60 Sephadex resin or filter paper, using methods well known in 

nanocapsules) or in macroemulsions. Such techniques are the art. The immobilized antibody then is contacted with a 

disclosed in Remington's Pharmaceutical Sciences, supra. sample containing the PR019598 to be purified, and there- 

The formulations to be used for in vivo administration after the support is washed with a suitable solvent that will 

must be sterile. This is readily accomplished by filtration remove substantially all the material in the sample except 

through sterile filtration membranes. 65 the PR019598, which is bound to the immobilized antibody. 

Sustained-release preparations may be prepared. Suitable Finally, the support is washed with another suitable solvent 

examples of sustained -release preparations include semiper- that will release the PR019598 from the antibody. 
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The following examples are offered for illustrative pur- 
poses only, and are not intended to limit the scope of the 
present invention in any way. 

All patent and literature references cited in the present 
specification are hereby incorporated by reference in their 
entirety. 

EXAMPLES 

Commercially available reagents referred to in the 
examples were used according to manufacturer's instruc- 
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reverse PCR primer 



(SEQ ID N0:4) 



><H3 ecd.snrl 5 * TAAGTGCCCTCCCAGGCTGCC 3* 

Additionally, a synthetic oligonucleotide hybridization 
probe was constructed from the consensus DNA132879 
sequence which had the following nucleotide sequence 
hybridization probe 



5 ' TCCTCCAGTCATGAATATAACCCAAGTCAATGGCTCTTTGTTGGTAATTCTC 3' (SEQ ID N0:5) 



tions unless otherwise indicated. The source of those cells 
identified in the following examples, and throughout the 
specification, by ATCC accession numbers is the American 
Type Culture Collection, Manassas, Va. 

EXAMPLE 1 20 
Isolation of cDNA Clones Encoding a Human PR019598 

The extracellular domain (ECD) sequences (including the 
secretion signal sequence, if any) from about 950 known 
secreted proteins from the Swiss-Prot public database were 25 
used to search sequence databases. The databases included 
public databases (e.g., GenBank) In this instance, genomic 
DNA sequence from GenBank was analyzed using the gene 
preditiction program GENSCAN, licenced from Stanford 
University. GENSCAN analysis predicts gene coding 30 
regions, creating sequences which can be subjected to the 
ECD search. The search was performed using the computer 
program BLAST or BLAST2 [Altschul et al, Methods in 
Enzymology, 266:460-480 (1996)] as a comparison of the 
ECD protein sequences to a 6 frame translation of the 35 
sequences. Those comparisons resulting in a BLAST score 
of 70 (or in some cases, 90) or greater that did not encode 
known proteins were clustered and assembled into consen- 
sus DNA sequences with the program "phrap" (Phil Green, 
University of Washington, Seattle, Wash.). 40 

A consensus DNA sequence was assembled. This con- 
sensus sequence is herein designated DNA132879. 

Based on the DNA132879 consensus sequence, oligo- 
nucleotides were synthesized: 1) to identify by PCR a cDNA 45 
library that contained the sequence of interest, and 2) for use 
as probes to isolate a clone of the full-length coding 
sequence for PRO 195 98. Forward and reverse PCR primers 
generally range from 20 to 30 nucleotides and are often 
designed to give a PCR product of about 100-1000 bp in 50 
length. The probe sequences are typically 40-55 bp in 
length. In some cases, additional oligonucleotides are syn- 
thesized when the consensus sequence is greater than about 
1-1.5 kbp. In order to screen several libraries for a full- 
length clone, DNA from the libraries was screened by PCR 55 
amplification, as per Ausubel et al., Current Protocols in 
Molecular Biology, supra, with the PCR primer pair. A 
positive library was then used to isolate clones encoding the 
gene of interest using the probe oligonucleotide and one of 
the primer pairs. 60 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 

(SEQ ID N0:3) 65 
><H3 ecd.snf 1 5 ' CTGGCAACAGCAGTGTCTATTTTGTGC 3* 



A pool of 50 different human cDNA libraries from various 
tissues was used in cloning. The cDNA libraries used to 
isolate the cDNA clones were constructed by standard 
methods using commercially available reagents such as 
those from Invitrogen, San Diego, Calif. The cDNA was 
primed with oligo dT containing a NotI site, linked with 
blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined 
orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not 
contain the Sfil site; see, Holmes et al., Science, 
253:1278-1280 (1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above 
gave the full-length DNA sequence for a full-length 
PR019598 polypeptide (designated herein as DNA145887- 
2849 [FIG. 1, SEQ ID NO: 1) and the derived protein 
sequence for that PRO 195 98 polypeptide. 

The full length clone identified above contained a single 
open reading frame with an apparent translational initiation 
site at nucleotide positions 241-243 and a stop signal at 
nucleotide positions 1027-1030 (FIG. 1, SEQ ID NO:l). 
The predicted polypeptide precursor is 262 amino acids 
long, has a calculated molecular weight of approximately 
30419 daltons and an estimated pi of approximately 8.44. 
Analysis of the full-length PRO 195 98 sequence shown in 
FIG. 2 (SEQ ID NO: 2) evidences the presence of a variety 
of import ant polypeptide domains as shown in FIG. 2, 
wherein the locations given for those important polypeptide 
domains are approximate as described above. Clone 
DNA145887-2849 has been deposited with ATCC on Mar. 
21, 2000 and is assigned ATCC deposit no. PTA-1532. 

An analysis of the Day hoff database (version 35.45 Swis- 
sProt 35), using the ALIGN- 2 sequence alignment analysis 
of the full-length sequence shown in FIG. 2 (SEQ ID NO:2), 
evidenced sequence identity between the PRO 19598 amino 
acid sequence and the following Dayhoff 
sequences:AF184971_l . 

EXAMPLE 2 
Use of PR019598 as a Hybridization Probe 

The following method describes use of a nucleotide 
sequence encoding PRO 19598 as a hybridization probe. 

DNA comprising the coding sequence of full-length or 
mature PRO 19598 as disclosed herein is employed as a 
probe to screen for homologous DNAs (such as those 
encoding naturally-occurring variants of PR019598) in 
human tissue cDNA libraries or human tissue genomic 
libraries. 

Hybridization and washing of filters containing either 
library DNAs is performed under the following high strin- 
gency conditions. Hybridization of radiolabeled PRO 19598- 
derived probe to the filters is performed in a solution of 50% 
formamide, 5xSSC, 0.1% SDS, 0.1% sodium 
pyrophosphate, 50 mM sodium phosphate, pH 6.8, 
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2xDenhardt's solution, and 10% dextran sulfate at 42° C. for £. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) 

20 hours. Washing of the filters is performed in an aqueous is resuspended in 10 volumes (w/v) in 7 M guanidine, 20 

solution of O.lxSSC and 0.1% SDS at 42° C. mM Tris, pH 8 buffer. Solid sodium sulfite and sodium 

DNAs having a desired sequence identity with the DNA tetrathionate is added to make final concentrations of 0.1 M 
encoding full-length native sequence PRO 195 98 can then be 5 and 0.02 M, respectively, and the solution is stirred over- 
identified using standard techniques known in the art. night at 4° C. This step results in a denatured protein with 
EXAMPLE 3 a ^ c y steme res id ues blocked by sulfitolization. The solution 
Expression of PR019598 in E. coli f 40 ' 000 , » " Beckman UltracenUfuge 

This example illustrates preparation of an unglycosylated tot ™ supernatant ts diluted with 3- volumes of 

form of PR019598 by recombinant expression m E coli. 10 ?'f ^'"T, guanidine, 20 mM Tns, 

The DNA sequence encoding PR019598 is initially Pi* 7 f> . a ° d fil * ered though 0.22 micron filters to clarify. 

a lifi . „•„ 2>\~~t*A ur-D „JL** n, a The clarified extract is loaded onto a 5 ml Qiagen Ni-NTA 

ampwiea using selected rCR pnmers. Ine primers should , , , . fM , . . , , , 

\ • ^ jJ?;«« olt L n ,u;~u t~ tu metal chelate column equilibrated in the metal chelate 

contain restriction enzyme sites which correspond to the . . « „ . ^ . , . 

restriction enzyme sites on the selected expression vector. A column buffer. The column * washed with additional buffer 

variety of expression vectors may be employed. An example is f°?™ 8 . ■ !, .(^ bl ° chem ' Utrol P" 

of a suitable vector is pBR322 (derived from E. coli; see 1A A Jhc 1 P ro J e,D ■ elu,ed bu * er «>ntauiing 250 mM 

Bolivar et al., Gene, 2:95 (1977)) which contains genes for FracUons containing the desired protein are 

ampicillin and tetracycline resistance. The vector is digested P 00 ed and s « ored a l n C Protem concentration is estimated 

with restriction enzyme and dephosphorylated. The PCR «5 abs °rt>ance at 280 nm using the calculated extincUon 

amplified sequences are then ligated into the vector. The 20 client based 00 >«? J"™? a « d sequence, 

vector will preferably include sequences which encode for . ^ P*? tems 316 f fo j ded jilting *e sam P!% s * ow |y 

an antibiotic resistance gene, a trp promoter, a polyhis leader mto ^J^Z^S^ xf of: 20 m ™ 

(including the first six STII codons, polyhis ^quence, and T ™> P« 86 > 03 ™ Na <?> 25 M urea, 5 mM cysteme, 20 

. . , \ nnriinrno 7- . mM glycine and 1 mM EDTA. Refolding volumes are 

enterokmase cleavage site), the PRO 195 98 coding region, , , , ^ , • • 

lambda transcriptional terminator, and an argU gene. 25 chos ^ 50 that the ^ P r 2} em concentration is between 50 

The ligation mixture is then used to transform a selected t0 \9° "IfJP^^J** refi ^ g f?" U ° n 15 ^ 

E. coli strain using the methods described in Sambrook et ^lly »t 4 C u f °\} 2 7 36 ^ ^ «foldmg reacuon is 

al., supra. Transformants are identified by their ability to ^ u ? addiUon of TFA to a final concentra ion of 

grow on LB plates and antibiotic resistant colonies are then ^4% (pH of approximately 3) Before .further purification of 

selected. Plasmid DNA can be isolated and confirmed by 30 £ e P ro ' em » ^ sohition is filte^ through • 022 miCrOD 

• ■ nnA r . KTA • n filter and acetonitnle is added to 2-10% final concentration. 

restriction analysis and DNA sequencing. , . . 

o , * . , . * u* • i* j i*. The refolded protem is chromatographed on a Poros Rl/H 

Selected clones can be grown overnight in liquid culture JL , . • , , <r r ^ 

u t n u «u i * a -.u reversed phase column usmg a mobile buffer of 0.1% TFA 

medium such as LB broth supplemented with antibiotics. . t . Y . t B c . r 

t, • u* i*. i_ « . , 4 . . 4 with elution with a gradient of acetonitnle from 10 to 80%. 

The overnight culture may subsequently be used to inoculate A1 . e > . & . , A ^™ , 

a larger scale culture. The cells L then grown to a desired 35 ^ uo * of fracll0ns W1 ! h ^9 absorbance are analyzed on 

opted density, during which the expression promoter is SDS P°^cryhmide gels and fractions containing homoge- 

turned on neous refolded protein are pooled. Generally, the properly 

a ru ' u • 4 . r a u *u 11 refolded species of most proteins are eluted at the lowest 

After cultunng the cells for several more hours, the cells Y . r „ 

can be harvested by centrifugation. The cell pellet obtained "'rations of a «!°°^ smce those spec.es are the 

by the centrifugation can be solubilized using various agents 40 most compact with their hydrophobic interiors shielded from 

known in the art, and the solubilized PR019598 protein can "teracUon ^ th ' he reversed phase resin Aggregated spe- 

then be purified using a metal chelating column under "es are usually eluted at higher acetonitnle concentrations. 

conditions that allow tight binding of the protein. a . dd l tlon t0 f*™* m f °! ded forms ? f P ro,eins from J the 

nnni ncno , j ♦ r »• - i u- desu-ed form, the reversed phase step also removes endot- 

PRO 19598 may be expressed in E. coli in a poly-His . f , ' . F F 

tagged form, using the following procedure. The DNA 45 oxm trom the samples. DDrMQCOO 

DDni0 coo v ■ ii l„rfi 0 j „™„ Fractions containing the desired folded PR019598 

encoding PR019598 is initially amplified usmg selected , tJ J , 

or-D -pi ^ M< . t „-^»:JL polypeptide are pooled and the acetonitnle removed using a 

PCR primers. Ine pnmers will contain restnction enzyme r jr r / . .... , A . „ . 

0,^0 ^^c-™^ *^ f . ^ _ # • # - ^ ^ ^ „ rt ' . . gentle stream of nitrogen directed at the solution. Proteins 

sites which conespond to the restnction enzyme sites on the to r , . . _ , _ _ , T ^ ^ . , ^ . . _ _ 

selected expression vector, and other useful sequences pro- are formulated into 20 mM Hepes, pH 6 .8 with 0 14 M 

viding for efficient and refiable translation initiation, rapid 50 ^ ium ^hlonde and 4% mannnol by dialysis or by gel 

purification on a metal chelation column, and proteolytic ™ntou usmg G25 Superfine (Pharmacia) resins equib- 

removal with enterokinase. The PCR-amplified, poly-His bra ^ " ^ fo ™ UlaU °° ^^f?**. 

tagged sequences are then ligated into an expression vector, A ™019598 polypeptides disclosed herein were success- 

which is used to transform an E. coli host based on strain 52 ^ jessed ** described above. 

(W3110 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(laclq). 55 EXAMPLE 4 

Transformants are first grown in LB containing 50 mg/ml Expression of PR019598 in Mammalian Cells 

carbenicillin at 30° C. with shaking until an O.D.600 of 3-5 This example illustrates preparation of a potentially gly- 

is reached. Cultures are then diluted 50-100 fold into CRAP cosylated form of PRO 19598 by recombinant expression in 

media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g mammalian cells. 

sodium citrate.2H20, 1.07 gKCi, 5.36 gDifco yeast extract, 60 The vector, pRK5 (see EP 307,247, published Mar. 15, 

5.36 g Sheffield hycase SF in 500 mL water, as well as 110 1989), is employed as the expression vector. Optionally, the 

mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM PR019598 DNA is ligated into pRK5 with selected restric- 

MgS0 4 ) and grown for approximately 20-30 hours at 30° C. tion enzymes to allow insertion of the PR019598 DNA 

with shaking. Samples are removed to verify expression by using ligation methods such as described in Sambrook et al., 

SDS-PAGE analysis, and the bulk culture is centrifiiged to 65 supra. The resulting vector is called pRK5 -PRO 19598. 

pellet the cells. Cell pellets are frozen until purification and In one embodiment, the selected host cells may be 293 

refolding. cells. Human 293 cells (ATCC CCL 1573) are grown to 
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confluence in tissue culture plates in medium such as 
DMEM supplemented with fetal calf serum and optionally, 
nutrient components and/or antibiotics. About 10 1% pRK5- 
PR019598 DNA is mixed with about 1 /*g DNA encoding 
the VARNA gene [Thimmappaya et al., Cell, 31:543 (1982)] 
and dissolved in 500 {d of 1 mM Tris-HCl, 0.1 mM EDTA, 
0.227 M CaCl 2 . To this mixture is added, dropwise, 500 fi\ 
of 50 mM HEPES (pH 7.35), 280 mM NaCl, 1.5 mM 
NaP0 4 , and a precipitate is allowed to form for 10 minutes 
at 25° C. The precipitate is suspended and added to the 293 
cells and allowed to settle for about four hours at 37° C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in 
PBS is added for 30 seconds. The 293 cells are then washed 
with serum free medium, fresh medium is added and the 
cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the cul- 
ture medium is removed and replaced with culture medium 
(alone) or culture medium containing 200 /*Ci/ml 35 S- 
cysteine and 200 /iCi/ml 35 S-methionine. After a 12 hour 
incubation, the conditioned medium is collected, concen- 
trated on a spin filter, and loaded onto a 15% SDS gel. The 
processed gel may be dried and exposed to film for a 
selected period of time to reveal the presence of PRO 195 98 
polypeptide. The cultures containing transfected cells may 
undergo further incubation (in serum free medium) and the 
medium is tested in selected bioassays. 

In an alternative technique, PRO 19598 may be introduced 
into 293 cells transiently using the dextran sulfate method 
described by Somparyrac et al, Proc. Natl. Acad. ScL, 
12:7575 (1981). 293 cells are grown to maximal density in 
a spinner flask and 700 pig pRK5-PR019598 DNA is added. 
The cells are first concentrated from the spinner flask by 
centrifugation and washed with PBS. The DNA-dextran 
precipitate is incubated on the cell pellet for four hours. The 
cells are treated with 20% glycerol for 90 seconds, washed 
with tissue culture medium, and re-introduced into the 
spinner flask containing tissue culture medium, 5 ^g/ml 
bovine insulin and 0.1 //g/ml bovine transferrin. After about 
four days, the conditioned media is centrifuged and filtered 
to remove cells and debris. The sample containing expressed 
PR019598 can then be concentrated and purified by any 
selected method, such as dialysis and/or column chroma- 
tography. 

In another embodiment, PR019598 can be expressed in 
CHO cells. The pRK5-PR019598 can be transfected into 
CHO cells using known reagents such as CaP0 4 or DEAE- 
dextran. As described above, the cell cultures can be 
incubated, and the medium replaced with culture medium 
(alone) or medium containing a radiolabel such as 35 S- 
methionine. After determining the presence of PRO 19598 
polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated 
for about 6 days, and then the conditioned medium is 
harvested. The medium containing the expressed PRO 195 98 
can then be concentrated and purified by any selected 
method. 

Epitope-tagged PR019598 may also be expressed in host 
CHO cells. The PR019598 may be subcloned out of the 
pRK5 vector. Hie subclone insert can undergo PCR to fuse 
in frame with a selected epitope tag such as a poly-his tag 
into a Baculovirus expression vector. The poly-his tagged 
PR019598 insert can then be subcloned into a SV40 driven 
vector containing a selection marker such as DHFR for 
selection of stable clones. Finally, the CHO cells can be 
transfected (as described above) with the SV40 driven 
vector. Labeling may be performed, as described above, to 
verify expression. The culture medium containing the 



expressed poly-His tagged PRO 19598 can then be concen- 
trated and purified by any selected method, such as by 
Ni 2+ -chelate affinity chromatography. 
PR019598 may also be expressed in CHO and/or COS 

5 cells by a transient expression procedure or in CHO cells by 
another stable expression procedure. 

Stable expression in CHO cells is performed using the 
following procedure. The proteins are expressed as an IgG 
construct (immunoadhesin), in which the coding sequences 

io for the soluble forms (e.g. extracellular domains) of the 
respective proteins are fused to an IgGl constant region 
sequence containing the hinge, CH2 and CH2 domains 
and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs are 

15 subcloned in a CHO expression vector using standard tech- 
niques as described in Ausubel et al., Current Protocols of 
Molecular Biology, Unit 3.16, John Wiley and Sons (1997). 
CHO expression vectors are constructed to have compatible 
restriction sites 5 1 and 3' of the DNA of interest to allow the 

20 convenient shuttling of cDNA's. The vector used expression 
in CHO cells is as described in Lucas et al., Nucl Acids Res. 
24:9 (1774-1779 (1996), and uses the SV40 early promoter/ 
enhancer to drive expression of the cDNA of interest and 
dihydrofolate reductase (DHFR). DHFR expression permits 

25 selection for stable maintenance of the plasmid following 
transfection. 

Twelve micrograms of the desired plasmid DNA is intro- 
duced into approximately 10 million CHO cells using com- 
mercially available transfection reagents Superfect® 

30 (Qiagen), Dosper® or Fugene® (Boehringer Mannheim). 
The cells are grown as described in Lucas et al., supra. 
Approximately 3xl0 7 cells are frozen in an ampule for 
further growth and production as described below. 
The ampules containing the plasmid DNA are thawed by 

35 placement into water bath and mixed by vortexing. The 
contents are pipetted into a centrifuge tube containing 10 
mLs of media and centrifuged at 1000 rpm for 5 minutes. 
The supernatant is aspirated and the cells are resuspended in 
10 mL of selective media (0.2 //m filtered PS20 with 5% 0.2 

40 fim diafiltered fetal bovine serum). The cells are then ali- 
quoted into a 100 mL spinner containing 90 mL of selective 
media. After 1-2 days, the cells are transferred into a 250 mL 
spinner filled with 150 mL selective growth medium and 
incubated at 37° C. After another 2-3 days, 250 mL, 500 mL 

45 and 2000 mL spinners are seeded with 3x10 s cells/mL. The 
cell media is exchanged with fresh media by centrifugation 
and resuspension in production medium. Although any 
suitable CHO media may be employed, a production 
medium described in U.S. Pat. No. 5,122,469, issued Jun. 

50 16, 1992 may actually be used. A 3L production spinner is 
seeded at 1.2xl0 6 cells/mL. On day 0, the cell number pH 
ie determined. On day 1, the spinner is sampled and sparging 
with filtered air is commenced. On day 2, the spinner is 
sampled, the temperature shifted to 33° C, and 30 mLof 500 

55 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% poly- 
dimethylsiloxane em Dow Corning 365 Medical Grade 
Emulsion) taken. Throughout the production, the pH is 
adjusted as necessary to keep it at around 7.2. After 10 days, 
or until the viability dropped below 70%, the cell culture is 

60 harvested by centrifugation and filtering through a 0.22 fim 
filter. The filtrate was either stored at 4° C. or immediately 
loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are puri- 
fied using a Ni-NTA column (Qiagen). Before purification, 

65 imidazole is added to the conditioned media to a concen- 
tration of 5 mM. The conditioned media is pumped onto a 6 
ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, 
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buffer containing 0.3 M NaCl and 5 mM imidazole at a flow 
rate of 4-5 ml/min. at 4° C. After loading, the column is 
washed with additional equilibration buffer and the protein 
eluted with equilibration buffer containing 0.25 M imida- 
zole. The highly purified protein is subsequently desalted 
into a storage buffer containing 10 mM Hepes, 0.14 M NaQ 
and 4% mannitol, pH 6.8, with a 25 ml G25 Superfine 
(Pharmacia) column and stored at -80° C. 

Immunoadhesin (Fc-containing) constructs are purified 
from the conditioned media as follows. The conditioned 
medium is pumped onto a 5 ml Protein A column 
(Pharmacia) which had been equilibrated in 20 mM Na 
phosphate buffer, pH 6.8. After loading, the column is 
washed extensively with equilibration buffer before elution 
with 100 mM citric acid, pH 3.5. The eluted protein is 
immediately neutralized by collecting 1 ml fractions into 
tubes containing 275 fiL of 1 M Tris buffer, pH 9. The highly 
purified protein is subsequently desalted into storage buffer 
as described above for the poly-His tagged proteins. The 
homogeneity is assessed by SDS polyacrylamide gels and by 
N-terminal amino acid sequencing by Ed man degradation. 

EXAMPLE 5 
Expression of PR019598 in Yeast 

The following method describes recombinant expression 
of PR019598 in yeast. 

First, yeast expression vectors are constructed for intra- 
cellular production or secretion of PRO 19598 from the 
ADH2/GAPDH promoter. DNA encoding PR019598 and 
the promoter is inserted into suitable restriction enzyme sites 
in the selected plasmid to direct intracellular expression of 
PR019598. For secretion, DNA encoding PR019598 can be 
cloned into the selected plasmid, together with DNA encod- 
ing the ADH2/GAPDH promoter, a native PR019598 signal 
peptide or other mammalian signal peptide, or, for example, 
a yeast alpha-factor or invertase secretory signal/leader 
sequence, and linker sequences (if needed) for expression of 
PR019598. 

Yeast cells, such as yeast strain AB110, can then be 
transformed with the expression plasmids described above 
and cultured in selected fermentation media. The trans- 
formed yeast supernatants can be analyzed by precipitation 
with 10% trichloroacetic acid and separation by SDS-PAGE, 
followed by staining of the gels with Coomassie Blue stain. 

Recombinant PRO 195 98 can subsequently be isolated 
and purified by removing the yeast cells from the fermen- 
tation medium by centrifugation and then concentrating the 
medium using selected cartridge filters. The concentrate 
containing PR019598 may further be purified using selected 
column chromatography resins. 

EXAMPLE 6 

Expression of PRO 19598 in Baculo virus-infected Insect 
Cells 

The following method describes recombinant expression 
of PRO 19598 in Baculovirus-infected insect cells. 

The sequence coding for PRO 195 98 is fused upstream of 
an epitope tag contained within a baculovirus expression 
vector. Such epitope tags include poly-his tags and immu- 
noglobulin tags (like Fc regions of IgG). A variety of 
plasmids may be employed, including plasmids derived 
from commercially available plasmids such as pVL1393 
(Novagen). Briefly, the sequence encoding PRO 19598 or the 
desired portion of the coding sequence of PR019598 such as 
the sequence encoding the extracellular domain of a trans- 
membrane protein or the sequence encoding the mature 
protein if the protein is extracellular is amplified by PCR 
with primers complementary to the 5' and 3' regions. The 5' 
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primer may incorporate flanking (selected) restriction 
enzyme sites. The product is then digested with those 
selected restriction enzymes and subcloned into the expres- 
sion vector. 

5 Recombinant baculovirus is generated by co-transfecting 
the above plasmid and BaculoGold™ virus DNA 
(Pharmingen) into Spodoptera frugiperda ("Sf9") cells 
(ATCC CRL 1711) using lipofectin (commercially available 
from GIBCO-BRL). After 4-5 days of incubation at 28° C, 
the released viruses are harvested and used for further 
amplifications. Viral infection and protein expression are 
performed as described by O'Reilley et al., Baculovirus 
expression vectors: A Laboratory Manual, Oxford: Oxford 
University Press (1994). 
l5 Expressed poly-his tagged PR019598 can then be 
purified, for example, by Ni^-chelate affinity chromatogra- 
phy as follows. Extracts are prepared from recombinant 
virus-infected Sf9 cells as described by Rupert et al., Nature, 
362:175-179 (1993). Briefly, Sf9 cells are washed, resus- 
pended in sonication buffer (25 mL Hepes, pH 7.9; 12.5 mM 
MgCl 2 ; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M 
KC1), and sonicated twice for 20 seconds on ice. The 
sonicates are cleared by centrifugation, and the supernatant 
is diluted 50-fold in loading buffer (50 mM phosphate, 300 
25 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 
fim filter. A Ni 2+ -NTA agarose column (commercially avail- 
able from Qiagen) is prepared with a bed volume of 5 mL, 
washed with 25 mL of water and equilibrated with 25 mLof 
loading buffer. The filtered cell extract is loaded onto the 
3Q column at 0.5 mL per minute. The column is washed to 
baseline A^ with loading buffer, at which point fraction 
collection is started. Next, the column is washed with a 
secondary wash buffer (50 mM phosphate; 300 mM NaCl, 
10% glycerol, pH 6.0), which elutes nonspecifically bound 
35 protein. After reaching A^qq baseline again, the column is 
developed with a 0 to 500 mM Imidazole gradient in the 
secondary wash buffer. One mL fractions are collected and 
analyzed by SDS-PAGE and silver staining or Western blot 
with Ni 2+ -NTA-conjugated to alkaline phosphatase 
^ (Qiagen). Fractions containing the eluted His 10 -tagged 
PR019598 are pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc 
tagged) PR019598 can be performed using known chroma- 
tography techniques, including for instance, Protein A or 
45 protein G column chromatography. 

PRO 19598 polypeptides disclosed herein were success- 
fully expressed as described above. 

EXAMPLE 7 
Preparation of Antibodies that Bind PR019598 

50 This example illustrates preparation of monoclonal anti- 
bodies which can specifically bind PR019598. 

Techniques for producing the monoclonal antibodies are 
known in the art and are described, for instance, in Goding, 
supra. Immunogens that may be employed include purified 

55 PRO 19598, fusion proteins containing PR019598, and 
cells expressing recombinant PR019598 on the cell surface. 
Selection of the immunogen can be made by the skilled 
artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PR019598 

60 immunogen emulsified in complete Freund's adjuvant and 
injected subcutaneously or intra peri toneally in an amount 
from 1-100 micrograms. Alternatively, the immunogen is 
emulsified in MPL-TDM adjuvant (Ribi Immunochemical 
Research, Hamilton, Mont.) and injected into the animal's 

65 hind foot pads. The immunized mice are then boosted 10 to 
12 days later with additional immunogen emulsified in the 
selected adjuvant. Thereafter, for several weeks, the mice 
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may also be boosted with additional immunization injec- 
tions. Serum samples may be periodically obtained from the 
mice by retro-orbital bleeding for testing in EUS A assays to 
detect anti-PR019598 antibodies. 

After a suitable antibody titer has been detected, the 
animals "positive" for antibodies can be injected with a final 
intravenous injection of PR019598. Three to four days later, 
the mice are sacrificed and the spleen cells are harvested. 
The spleen cells are then fused (using 35% polyethylene 
glycol) to a selected murine myeloma cell line such as 
P3X63AgU.l, available from ATCC, No. CRL 1597. The 
fusions generate hybridoma cells which can then be plated 
in 96 well tissue culture plates containing HAT 
(hypoxanthine, aminopterin, and thymidine) medium to 
inhibit proliferation of non-fused cells, myeloma hybrids, 
and spleen cell hybrids. 

The hybridoma cells will be screened in an EliSA for 
reactivity against PRO 19598. Determination of "positive" 
hybridoma cells secreting the desired monoclonal antibodies 
against PR019598 is within the skill in the art. 

The positive hybridoma cells can be injected intraperito- 
neally into syngeneic Balb/c mice to produce ascites con- 
taining the anti-PR019598 monoclonal antibodies. 
Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the mono- 
clonal antibodies produced in the ascites can be accom- 
plished using ammonium sulfate precipitation, followed by 
gel exclusion chromatography. Alternatively, affinity chro- 
matography based upon binding of antibody to protein A or 
protein G can be employed. 

EXAMPLE 8 
PR019598 Polypeptides Using Specific 
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Purification 
Antibodies 

Native or recombinant PRO 19598 polypeptides may be 
purified by a variety of standard techniques in the art of 
protein purification. For example, pro-PROl9598 
polypeptide, mature PR019598 polypeptide, or pre- 
PR019598 polypeptide is purified by immunoaffinity chro- 
matography using antibodies specific for the PR019598 
polypeptide of interest. In general, an immunoaffinity col- 
umn is constructed by covalently coupling the anti- 
PRO 19598 polypeptide antibody to an activated chromato- 
graphic resin. 

Polyclonal immunoglobulins are prepared from immune 
sera either by precipitation with ammonium sulfate or by 
purification on immobilized Protein A (Pharmacia LKB 
Biotechnology, Piscataway, NJ.). Likewise, monoclonal 
antibodies are prepared from mouse ascites fluid by ammo- 
nium sulfate precipitation or chromatography on immobi- 
lized Protein A. Partially purified immunoglobulin is 50 
covalently attached to a chromatographic resin such as 
CnBr-activated SEPHAROSE™ (Pharmacia LKB 
Biotechnology). The antibody is coupled to the resin, the 
resin is blocked, and the derivative resin is washed accord- 
ing to the manufacturer's instructions. 

Such an immunoaffinity column is utilized in the purifi- 
cation of PRO 19598 polypeptide by preparing a fraction 
from cells containing PR019598 polypeptide in a soluble 
form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differ- 
ential centrifiigation by the addition of detergent or by other 
methods well known in the art. Alternatively, soluble 
PR019598 polypeptide containing a signal sequence may be 
secreted in useful quantity into the medium in which the 
cells are grown. 

A soluble PRO 19598 polypeptide-containing preparation 
is passed over the immunoaffinity column, and the column 
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is washed under conditions that allow the preferential absor- 
bance of PRO 195 98 polypeptide (e.g., high ionic strength 
buffers in the presence of detergent). Then, the column is 
eluted under conditions that disrupt antibody/PR019598 
polypeptide binding (e.g., a low pH buffer such as approxi- 
mately pH 2-3, or a high concentration of a chaotrope such 
as urea or thiocyanate ion), and PR019598 polypeptide is 
collected. 

EXAMPLE 9 

Drug Screening 

This invention is particularly useful for screening com- 
pounds by using PRO 19598 polypeptides or binding frag- 
ment thereof in any of a variety of drug screening tech- 
niques. The PR019598 polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid 
support, borne on a cell surface, or located intracellularly. 
One method of drug screening utilizes eukaryotic or 
prokaryotic host cells which are stably transformed with 
recombinant nucleic acids expressing the PRO 19598 
polypeptide or fragment. Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, 
either in viable or fixed form, can be used for standard 
binding assays. One may measure, for example, the forma- 
tion of complexes between PR019598 polypeptide or a 
fragment and the agent being tested. Alternatively, one can 
examine the diminution in complex formation between the 
PRO 195 98 polypeptide and its target cell or target receptors 
caused by the agent being tested. 

Thus, the present invention provides methods of screen- 
ing for drugs or any other agents which can affect a 
PR019598 polypep tide-associated disease or disorder. 
These methods comprise contacting such an agent with an 
PR019598 polypeptide or fragment thereof and assaying (1) 
for the presence of a complex between the agent and the 
PR019598 polypeptide or fragment, or (ii) for the presence 
of a complex between the PRO 19598 polypeptide or frag- 
ment and the cell, by methods well known in the art. In such 
competitive binding assays, the PRO 19598 polypeptide or 
fragment is typically labeled. After suitable incubation, free 
PR019598 polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncom- 
plexed label is a measure of the ability of the particular agent 
to bind to PRO 19598 polypeptide or to interfere with the 
PR019598 polypeptide/cell complex. 

Another technique for drug screening provides high 
throughput screening for compounds having suitable bind- 
ing affinity to a polypeptide and is described in detail in WO 
84/03564, published on Sep. 13, 1984. Briefly stated, large 
numbers of different small peptide test compounds are 
synthesized on a solid substrate, such as plastic pins or some 
other surface. As applied to a PRO 19598 polypeptide, the 
peptide test compounds are reacted with PRO 19598 
polypeptide and washed. Bound PRO 195 98 polypeptide is 
detected by methods well known in the art. Purified 
PR019598 polypeptide can also be coated directly onto 
plates for use in the aforementioned drug screening tech- 
niques. In addition, non-neutralizing antibodies can be used 
to capture the peptide and immobilize it on the solid support. 

This invention also contemplates the use of competitive 
drug screening assays in which neutralizing antibodies 
capable of binding PR019598 polypeptide specifically com- 
pete with a test compound for binding to PR019598 
polypeptide or fragments thereof. In this manner, the anti- 
bodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with 
PR019598 polypeptide. 
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EXAMPLE 10 

Rational Drug Design 

The goal of rational drug design is to produce structural 
analogs of biologically active polypeptide of interest (i.e., a 
PR019598 polypeptide) or of small molecules with which 
they interact, e.g., agonists, antagonists, or inhibitors. Any of 
these examples can be used to fashion drugs which are more 
active or stable forms of the PR019598 polypeptide or 
which enhance or interfere with the function of the 
PRO 19598 polypeptide in vivo (cf., Hodgson, Biol 
Technology, 9: 19-21(1991)). 

In one approach, the three-dimensional structure of the 
PR019598 polypeptide, or of an PR019598 polypeptide- 
inhibitor complex, is determined by x-ray crystallography, 
by computer modeling or, most typically, by a combination 
of the two approaches. Both the shape and charges of the 
PRO 195 98 polypeptide must be ascertained to elucidate the 
structure and to determine active site(s) of the molecule. 
Less often, useful information regarding the structure of the 
PRO 19598 polypeptide may be gained by modeling based 
on the structure of homologous proteins. In both cases, 
relevant structural information is used to design analogous 
PR019598 polypeptide-like molecules or to identify effi- 
cient inhibitors. Useful examples of rational drug design 
may include molecules which have improved activity or 
stability as shown by Braxton and Wells, Biochemistry, 
31:7796-7801 (1992) or which act as inhibitors, agonists, or 
antagonists of native peptides as shown by Athauda et al.,/. 
Biocfiem., 113:742-746 (1993). 

It is also possible to isolate a target-specific antibody, 
selected by functional assay, as described above, and then to 
solve its crystal structure. This approach, in principle, yields 
a pharmacore upon which subsequent drug design can be 
based. It is possible to bypass protein crystallography alto- 
gether by generating anti-idiotypic antibodies (anti-ids) to a 
functional, pharmacologically active antibody. As a mirror 
image of a mirror image, the binding site of the anti-ids 
would be expected to be an analog of the original receptor. 
The anti-id could then be used to identify and isolate 
peptides from banks of chemically or biologically produced 
peptides. The isolated peptides would then act as the phar- 
macore. 

By virtue of the present invention, sufficient amounts of 
the PRO 19598 polypeptide may be made available to per- 
form such analytical studies as X-ray crystallography. In 
addition, knowledge of the PRO 19598 polypeptide amino 
acid sequence provided herein will provide guidance to 
those employing computer modeling techniques in place of 
or in addition to x-ray crystallography. 

EXAMPLE 11 
Tissue Expression Distribution 

Oligonucleotide probes were constructed from the 
PR019598 polypeptide-encoding nucleotide sequence 
shown in FIG. 1 for use in quantitative PCR amplification 
reactions. The oligonucleotide probes were chosen so as to 
give an approximately 200-600 base pair amplified frag- 
ment from the 3' end of its associated template in a standard 
PCR reaction. The oligonucleotide probes were employed in 
standard quantitative PCR amplification reactions with 
cDNA libraries isolated from different human adult and/or 
fetal tissue sources and analyzed by agarose gel electro- 
phoresis so as to obtain a quantitative determination of the 
level of expression of the PRO 195 98 polypeptide-encoding 
nucleic acid in the various tissues tested. Knowledge of the 
expression pattern or the differential expression of the 
PR019598 polypeptide-encoding nucleic acid in various 
different human tissue types provides a diagnostic marker 
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useful for tissue typing, with or without other tissue-specific 
markers, for determining the primary tissue source of a 
metastatic tumor, disease diagnosis, and the like. These 
assays provided the following results: 



DNA Molecule 

Tissues w/Significant Expression 

10 

DNA145887-2849 Highly expressed in cartilage, placenta and spleen. 

It is expressed in testis, adrenal gland and prostate. 
Tissues w/o Significant Expression 

DNA145887-2849 Is not expressed in HUVEC, colon tumor, heart, 
15 bone marrow, aortic endothelial cells or uterus. 



EXAMPLE 12 
Identification of Receptor/Ligand Interactions 

20 In this assay, various PRO polypeptides are tested for 
ability to bind to a panel of potential receptor molecules for 
the purpose of identifying receptor/ligand interactions. The 
identification of a ligand for a known receptor, a receptor for 
a known ligand or a novel receptor/ligand pair is useful for 

25 a variety of indications including, for example, targeting 
bioactive molecules (linked to the ligand or receptor) to a 
cell known to express the receptor or ligand, use of the 
receptor or ligand as a reagent to detect the presence of the 
ligand or receptor in a composition suspected of containing 

30 the same, wherein the composition may comprise cells 
suspected of expressing the ligand or receptor, modulating 
the growth of or another biological or immunological activ- 
ity of a cell known to express or respond to the receptor or 
ligand, modulating the immune response of cells or toward 

35 cells that express the receptor or ligand, allowing the 
preparaion of agonists, antagonists and/or antibodies 
directed against the receptor or ligand which will modulate 
the growth of or a biological or immunological activity of a 
cell expressing the receptor or ligand, and various other 

40 indications which will be readily apparent to the ordinarily 
skilled artisan. 

The assay is performed as follows. A PRO polypeptide of 
the present invention suspected of being a ligand for a 
receptor is expressed as a fusion protein containing the Fc 

45 domain of human IgG (an Immunoadhesin). Receptor- 
ligand binding is detected by allowing interaction of the 
immunoadhesin polypeptide with cells (e.g. Cos cells) 
expressing candidate PRO polypeptide receptors and visu- 
alization of bound immunoadhesin with fluorescent reagents 

50 directed toward the Fc fusion domain and examination by 
microscope. Cells expressing candidate receptors are pro- 
duced by transient transfection, in parallel, of defined sub- 
sets of a library of cDNA expression vectors encoding PRO 
polypeptides that may function as receptor molecules. Cells 

55 are then incubated for 1 hour in the presence of the PRO 
polypeptide immunoadhesin being tested for possible recep- 
tor binding. The cells are then washed and fixed with 
paraformaldehyde. The cells are then incubated with fluo- 
rescent conjugated antibody directed against the Fc portion 

60 of the PRO polypeptide immunoadhesin (e.g. FITC conju- 
gated goat anti-human-Fc antibody). The cells are then 
washed again and examined by microscope. A positive 
interaction is judged by the presence of fluorescent labeling 
of cells transfected with cDNA encoding a particular PRO 

65 polypeptide receptor or pool of receptors and an absence of 
similar fluorescent labeling of similarly prepared cells that 
have been transfected with other cDNA or pools of cDNA. 
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If a defined pool of cDNA expression vectors is judged to be was obtained and compared to a "universal" epithelial 

positive for interaction with a PRO polypeptide control sample which was prepared by pooling non- 

immunoadhesin, the individual cDNA species that comprise cancerous human tissues of epithelial origin, including liver, 

the pool are tested individually (the pool is "broken down") kidney, and lung. mRNA isolated from the pooled tissues 

to determine the specific cDNA that encodes a receptor able 5 represents a mixture of expressed gene products from these 

to interact with the PRO polypeptide immunoadhesin, different tissues. Microarray hybridization experiments 

In another embodiment of this assay, an epitope-tagged using the pooled control samples generated a linear plot in 

potential ligand PRO polypeptide (e.g. 8 histidine "His" tag) a 2-color analysis. The slope of the line generated in a 

is allowed to interact with a panel of potential receptor PRO 2-color analysis was then used to normalize the ratios of 

polypeptide molecules that have been expressed as fusions JQ (test: control detection) within each experiment. The nor- 

with the Fc domain of human IgG (immunoadhesins). Fol- malized ratios from various experiments were then com- 

lowing a 1 hour co-incubation with the epitope tagged PRO pared and used to identify clustering of gene expression, 

polypeptide, the candidate receptors are each immunopre- Thus, the pooled "universal control" sample not only 

cipitated with protein A beads and the beads are washed. allowed effective relative gene expression determinations in 

Potential ligand interaction is determined by western blot a simple 2-sample comparison, it also allowed multi-sample 

analysis of the immunoprecipitated complexes with anti- comparisons across several experiments, 

body directed towards the epitope tag. An interaction is In the present experiments, nucleic acid probes derived 

judged to occur if a band of the anticipated molecular weight from the herein described PRO polypeptide-encoding 

of the epitope tagged protein is observed in the western blot nucleic acid sequences were used in the creation of the 

analysis with a candidate receptor, but is not observed to microarray and RNA from the tumor tissues listed above 

occur with the other members of the panel of potential 20 were used for the hybridization thereto. A value based upon 

receptors. the normalized ratio rexperimental ratio was designated as a 

(1) PR019598 binds to PRO3301 and vice versa. "cutoff ratio". Only values that were above this cutoff ratio 

were determined to be significant. The results of these 

w . . . _ _ _ nw% ^ experiments are shown below, demonstrating that various 

Microarray Analysis to Detect Overexpression of PRO 25 PR0 polypeptides of the present invention are significantly 

Polypeptides m Cancerous Tumors overexpressed in various human tumor tissues as compared 

Nucleic acid microarrays often containing thousands of tQ a non . cancerous human iissaB control . M describe d 

gene sequences, are useful for identifying differentially abo ^ data demonstrate that the PRO of 

expressed genes in ceased tissues as compared to their ^ , m usefa , nQt Qm JXgnostic 

normal counterparts. Using nucleic acid microarrays, test 30 markers for the presence of one or more cancerous tumors, 

and control mRNA samples from test and control tissue bu , ^ m ag th eutk t ts for , he ^ lmcRt of 

samples are reverse transcribed and labeled to generate mose tumors 
cDNA probes. The cDNA probes are then hybridized to an 
array of nucleic acids immobilized on a solid support. The 

array is configured such that the sequence and position of 35 
each member of the array is known. For example, a selection 
of genes known to be expressed in certain disease states may 
be arrayed on a solid support. Hybridization of a labeled 
probe with a particular array member indicates that the 

sample from which the probe was derived expresses that 40 
gene. If the hybridization signal of a probe from a test 

(disease tissue) sample is greater than hybridization signal of Deposit of Material 

a probe from a control (normal tissue) sample the gene or ^ followin materials have been d ited ^ the 

genes overexpressed in the disease tissue are identified. The Am6m T Culture Collection, 10801 University Blvd., 

implication of this result is that an overexpressed protein in 45 ManassaSj Va 20110-2209, USA (ATCC): 
a diseased tissue is useful not only as a diagnostic marker for 
the presence of the disease condition, but also as a thera- 
peutic target for treatment of the disease condition. 

The methodology of hybridization of nucleic acids and 
microarray technology is well known in the art. In the 50 
present example, the specific preparation of nucleic acids for 
hybridization and probes, slides, and hybridization condi- 
tions are all detailed in U.S. Provisional Patent Application These deposits were made under the provisions of the 
Serial No. 60/193,767, filed on Mar. 31, 2000 and which is Budapest Treaty on the International Recognition of the 
herein incorporated by reference. 55 Deposit of Microorganisms for the Purpose of Patent Pro- 

In the present example, cancerous tumors derived from cedure and the Regulations thereunder (Budapest Treaty), 
various human tissues were studied for PRO polypeptide- This assures maintenance of a viable culture of the deposit 
encoding gene expression relative to non-cancerous human for 30 years from the date of deposit. The deposits will be 
tissue in an attempt to identify those PRO polypeptides made available by ATCC under the terms of the Budapest 
which are overexpressed in cancerous tumors. Two sets of 60 Treaty, and subject to an agreement between Genentech, Inc. 
experimental data were generated. In one set, cancerous and ATCC, which assures permanent and unrestricted avail- 
human colon tumor tissue and matched non-cancerous ability of the progeny of the culture of the deposit to the 
human colon tumor tissue from the same patient ("matched public upon issuance of the pertinent U.S. patent or upon 
colon control") were obtained and analyzed for PRO laying open to the public of any U.S. or foreign patent 
polypeptide expression using the above described microar- 65 application, whichever comes first, and assures availability 
ray technology. In the second set of data, cancerous human of the progeny to one determined by the U.S. Commissioner 
tumor tissue from any of a variety of different human tumors of Patents and Trademarks to be entitled thereto according to 



Molecule 


is overexpressed in: 


as compared to: 


PRO3301 


colon tumor 


universal normal control 


PRO3301 


lung tumor 


universal normal control 


PRO3301 


breast tumor 


universal normal control 


PRO3301 


rectal tumor 


universal normal control 



Material 


ATCC Deposit No. 


Deposit Date 


DNA145887-2849 


PTA-1532 


March 21, 2000 
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35 USC § 122 and the Commissioner's rules pursuant 
thereto (including 37 CFR § 1.14 with particular reference 
to 886 OG 638). 

The assignee of the present application has agreed that if 
a culture of the materials on deposit should die or be lost or 
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destroyed when cultivated under suitable conditions, the 
materials will be promptly replaced on notification with 
another of the same. Availability of the deposited material is 
not to be construed as a license to practice the invention in 
contravention of the rights granted under the authority of 
any government in accordance with its patent laws. 



SEQUENCE LISTING 

<160> NUMBER OF SEQ ID NOS : 7 

<210> SEQ ID NO 1 

<211> LENGTH: 1318 

<212> TYPE: DNA 

<213> ORGANISM: Homo Sapien 

<4 00> SEQUENCE: 1 



cagtttcttc 


atctgtaaca 


tcaaatgaat 


aataatacca 


atctcctaga 


50 


cttcataaga 


ggattaacaa 


agacaaaata 


tgggaaaaac 


ataacatggc 


100 


gtcccataat 


tattagatct 


tattattgac 


actaaaatgg 


cattaaaatt 


150 


accaaaagga 


agacagcatc 


tgtttcctct 


ttggtcctga 


gctggttaaa 


200 


aggaacactg 


gttgcctgaa 


cagtcacact 


tgcaaccatg 


atgectaaac 


250 


attgetttet 


aggcttcctc 


atcagtttct 


tccttactgg 


tgtagcagga 


300 


actcagtcaa 


cgcatgagtc 


tetgaagect 


cagagggtac 


aatttcagtc 


JDO 


ccgaaatttt 


cacaacattt 


igcaai.ygca 


gectgggagg 




400 


gcaacagcag 


tgtctatttt 


gtgeagtaca 


aaatcatgtt 


ctcatgcagc 


450 


atgaaaagct 


ctcaccagaa 


gccaagtgga 


tgctggcagc 


acatttcttg 


500 


taacttccca 


ggctgcagaa 


cattggctaa 


a-tatggacag 


agacaatgga 


550 


aaaataaaga 


ag actgttgg 


ggtactcaag 


aactctcttg 


tgaccttacc 


ouu 


agtgaaacct 


cagacataca 


ggaaccttat 


tacgggaggg 


tgagggegge 


650 


ctcggctggg 


agctactcag 


aatggagcat 


gacgccgcgg 


ttcactccct 


700 


ggtgggaaac 


aaaaatagat 


cctccagtca 


tgaatataac 


ccaagtcaat 


750 


ggctctttgt 


tggtaattct 


ccatgctcca 


aatttaccat 


atagatacca 


800 


aaaggaaaaa 


aatgtatcta 


tagaagatta 


ctatgaacta 


ctataccgag 


850 


tttttataat 


taacaattca 


ctagaaaagg 


agcaaaaggt 


ttatgaaggg 


900 


gctcacagag 


cggttgaaat 


tgaagctcta 


acaccacact 


ccagctactg 


950 


tgtagtggct 


gaaatatatc 


agcccatgtt 


agacagaaga 


agtcagagaa 


1000 


gtgaagagag 


atgtgtggaa 


attccatgac 


ttgtggaatt 


tggcattcag 


1050 


caatgtggaa 


attctaaagc 


tccctgagaa 


caggatgact 


cgtgtttgaa 


1100 


ggatcttatt 


taaaattgtt 


tttgtatttt 


cttaaagcaa 


tattcactgt 


1150 


tacaccttgg 


ggacttcttt 


gtttatccat 


tcttttatcc 


tttatatttc 


1200 


atttgtaaac 


tatatttgaa 


cgacattccc 


cccgaaaaat 


tgaaatgtaa 


1250 


agatgaggca 


gagaataaag 


tgttctatga 


aaaaaaaaaa 


aaaaaaaaaa 


1300 


aaaaaaaaaa 


aaaaaaaa 








1318 



<210> SEQ ID NO 2 
<211> LENGTH: 262 
<212> TYPE: PRT 
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<213> ORGANISM: Homo Sapien 
<400> SEQUENCE: 2 

Het Pro Lys His Cys Fhe Leu Gly Fhe Leu lie Ser Phe Phe Leu 
15 10 15 

Thr Gly Val Ala Gly Thr Gin Ser Thr His Glu Ser Leu Lys Pro 
20 25 30 

Gin Arg Val Gin Phe Gin Ser Arg Asn Phe His Asn lie Leu Gin 
35 40 45 

Trp Gin Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe 
50 55 60 

Val Gin Tyr Lys lie Met Phe Ser Cys Ser Met Lys Ser Ser His 
65 70 75 

Gin Lys Pro Ser Gly Cys Trp Gin His lie Ser Cys Asn Phe Pro 
80 85 90 

Gly Cys Arg Thr Leu Ala Lys Tyr Gly Gin Arg Gin Trp Lys Asn 
95 100 ~ 105 

Lys Glu Asp Cys Trp Gly Thr Gin Glu Leu Ser Cys Asp Leu Thr 
110 115 120 

Ser Glu Thr Ser Asp He Gin Glu Pro Tyr Tyr Gly Arg Val Arg 
125 130 135 

Ala Ala Ser Ala Gly Ser Tyr Ser Glu Trp Ser Met Thr Pro Arg 
140 145 150 

Phe Thr Pro Trp Trp Glu Thr Lys He Asp Pro Pro Val Met Asn 
155 160 165 

He Thr Gin Val Asn Gly Ser Leu Leu Val He Leu His Ala Pro 
170 175 180 

Asn Leu Pro Tyr Arg Tyr Gin Lys Glu Lys Asn Val Ser He Glu 
185 190 195 

Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe He He Asn Asn Ser 
200 205 210 

Leu Glu Lys Glu Gin Lys Val Tyr Glu Gly Ala His Arg Ala Val 
215 220 225 

Glu He Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val Ala 
230 235 240 

Glu He Tyr Gin Pro Met Leu Asp Arg Arg Ser Gin Arg Ser Glu 
245 250 255 

Glu Arg Cys Val Glu He Pro 
260 



<210> SEQ ID NO 3 
<211> LENGTH: 27 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide probe 
<400> SEQUENCE: 3 

ctggcaacag cagtgtctat tttgtgc 27 



<210> SEQ ID NO 4 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide probe 



<400> SEQUENCE: 4 
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taagtgccct cccaggctgc c 21 

<210> SEQ ID NO 5 
<211> LENGTH: 52 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide probe 
<400> SEQUENCE: 5 

tcctccagtc atgaatataa cccaagtcaa tggctctttg ttggtaattc 50 
tc 52 

<210> SEQ ID NO 6 

<211> LENGTH: 1705 

<212> TYPE: DNA 

<213> ORGANISM: Homo Sapien 

<4 00> SEQUENCE: 6 



tgaaatgact 


tccacggctg 


ggacgggaac 


cttccaccca 


cagctatgcc 


50 


tctgattggt 


gaatggtgaa 


ggtgcctgtc 


taacttttct 


gtaaaaagaa 


100 


ccagctgcct 


ccaggcagcc 


agccctcaag 


catcacttac 


aggaccagag 


150 


ggacaagaca 


tgactgtgat 


gaggagctgc 


tttcgccaat 


ttaacaccaa 


200 


gaagaattga 


ggctgcttgg 


gaggaaggcc 


aggaggaaca 


cgagactgag 


250 


agatgaattt 


tcaacagagg 


ctgcaaagcc 


tgtggacttt 


agccagaccc 


300 


ttctgccctc 


ctttgctggc 


gacagcctct 


caaatgcaga 


tggttgtgct 


350 


cccttgcctg 


ggttttaccc 


tgcttctctg 


gagccaggta 


tcaggggccc 


400 


agggccaaga 


attccacttt 


gggccctgcc 


aagtgaaggg 


ggttgttccc 


450 


cagaaactgt 


gggaagcctt 


ctgggctgtg 


aaagacacta 


tgcaagctca 


500 


ggataacatc 


acgagtgccc 


ggctgctgca 


gcaggaggtt 


ctgcagaacg 


550 


tctcggatgc 


tgagagctgt 


taccttgtcc 


acaccctgct 


ggagttctac 


600 


ttgaaaactg 


ttttcaaaaa 


ccaccacaat 


agaacagttg 


aagtcaggac 


650 


tctgaagtca 


ttctctactc 


tggccaacaa 


ctttgttctc 


atcgtgtcac 


700 


aactgcaacc 


cagtcaagaa 


aatgagatgt 


tttccatcag 


agacagtgca 


750 


cacaggcggt 


ttctgctatt 


ccggagagca 


ttcaaacagt 


tggacgtaga 


800 


agcagctctg 


accaaagccc 


ttggggaagt 


ggacattctt 


ctgacctgga 


850 


tgcagaaatt 


ctacaagctc 


tgaatgtcta 


gaccaggacc 


tccctccccc 


900 


tggcactggt 


ttgttccctg 


tgtcatttca 


aacagtctcc 


cttcctatgc 


950 


tgttcactgg 


acacttcacg 


cccttggcca 


tgggtcccat 


tcttggccca 


1000 


ggattattgt 


caaagaagtc 


attctttaag 


cagcgccagt 


gacagtcagg 


1050 


gaaggtgcct 


ctggatgctg 


tgaagagtct 


acagagaaga 


ttcttgtatt 


1100 


tattacaact 


ctatttaatt 


aatgtcagta 


tttcaactga 


agttctattt 


1150 


atttgtgaga 


ctgtaagtta 


catgaaggca 


gcagaatatt 


gtgccccatg 


1200 


cttctttacc 


cctcacaatc 


cttgccacag 


tgtggggcag 


tggatgggtg 


1250 


cttagtaagt 


acttaataaa 


ctgtggtgct 


ttttttggcc 


tgtctttgga 


1300 


ttgttaaaaa 


acagagaggg 


atgcttggat 


gtaaaactga 


acttcagagc 


1350 
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atgaaaatca 


cactgtcttc 


tgatatctgc 


agggacagag 


cattggggtg 


1400 


ggggtaaggt 


gcatctgttt 


gaaaagtaaa 


cgataaaatg 


tggattaaag 


1450 


tgcccagcac 


aaagcagatc 


ctcaataaac 


atttcatttc 


ccacccacac 


1500 


tcgccagctc 


accccatcat 


ccctttccct 


tggtgccctc 


cttttttttt 


1550 


tatcctagtc 


attcttccct 


aatcttccac 


ttgagtgtca 


agctgacctt 


1600 


gctgatggtg 


acattgcacc 


tggatgtact 


atccaatctg 


tgatgacatt 


1650 


ccctgctaat 


aaaagacaac 


ataactccaa 


aaaaaaaaaa 


aaaaaaaaaa 


1700 


aaaaa 










1705 



<210> SEQ ID NO 7 

<211> LENGTH: 206 

<212> TYPE: PRT 

<213> ORGANISM: Homo Sapien 

<400> SEQUENCE: 7 

Met Asn Phe Gin Gin Arg Leu Gin Ser Leu Trp Thr Leu Ala Arg 
1 5 10 15 

Pro Phe Cye Pro Pro Leu Leu Ala Thr Ala Ser Gin Met Gin Met 
20 25 30 

Val Val Leu Pro Cys Leu Gly Phe Thr Leu Leu Leu Trp Ser Gin 
35 40 " 45 

Val Ser Gly Ala Gin Gly Gin Glu Phe His Phe Gly Pro Cys Gin 
50 55 60 

Val Lys Gly Val Val Pro Gin Lys Leu Trp Glu Ala Phe Trp Ala 
65 70 75 

Val Lys Asp Thr Met Gin Ala Gin Asp Asn lie Thr Ser Ala Arg 
80 85 90 

Leu Leu Gin Gin Glu Val Leu Gin Asn Val Ser Asp Ala Glu Ser 
95 100 105 

Cys Tyr Leu Val His Thr Leu Leu Glu Phe Tyr Leu Lys Thr Val 
110 115 120 

Phe Lys Asn His His Asn Arg Thr Val Glu Val Arg Thr Leu Lys 
125 130 135 

Ser Phe Ser Thr Leu Ala Asn Asn Phe Val Leu lie Val Ser Gin 
140 145 150 

Leu Gin Pro Ser Gin Glu Asn Glu Met Phe Ser lie Arg Asp Ser 
155 160 165 

Ala His Arg Arg Phe Leu Leu Phe Arg Arg Ala Phe Lys Gin Leu 
170 175 180 

Asp Val Glu Ala Ala Leu Thr Lys Ala Leu Gly Glu Val Asp lie 
185 190 195 



Leu Leu Thr Trp Met Gin Lys Phe Tyr Lys Leu 
200 205 



What is claimed is: 

1. An isolated nucleic acid molecule having at least 80% 
sequence identity to: 60 

(a) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2, lacking 65 

its associated signal peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 



(d) the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO:l; or 

(e) the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532; wherein 
said isolated nucleic acid encodes a polypeptide which 
is a receptor for and binds to the ligand polypeptide 
shown as SEQ ID NO:4. 

2. An isolated nucleic acid molecule having at least 85% 
sequence identity to: 
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(a) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2, lacking its associated signal 
peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 

(d) the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO: 1; or 

(e) the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532; wherein 
said isolated nucleic acid encodes a polypeptide which 
is a receptor for and binds to the ligand polypeptide 
shown as SEQ ID NO:4. 

3. An isolated nucleic acid molecule having at least 90% 
sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO: 2, lacking its associated signal 
peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 

(d) the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO: 1; or 

(e) the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532; wherein 
said isolated nucleic acid encodes a polypeptide which 
is a receptor for and binds to the ligand polypeptide 
shown as SEQ ID NO:4. 

4. An isolated nucleic acid molecule having at least 95% 
sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2, lacking its associated signal 
peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 

(d) the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO:l; or 

(e) the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532; wherein 
said isolated nucleic acid encodes a polypeptide which 
is a receptor for and binds to the ligand polypeptide 
shown as SEQ ID NO:4. 

5. An isolated nucleic acid molecule having at least 99% 
sequence identity to: 

(a) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2, lacking its associated signal 
peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 

(d) the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO:l; or 
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25 



30 



35 



40 



45 



50 
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(e) the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532; wherein 
said isolated nucleic acid encodes a polypeptide which 
is a receptor for and binds to the ligand polypeptide 
shown as SEQ ID NO:4. 

6. An isolated nucleic acid molecule comprising: 

(a) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO: 2, lacking its associated signal 
peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 

(d) the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO:l; or 

(e) the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532. 

7. The isolated nucleic acid molecule of claim 6 compris- 
ing a nucleic acid sequence encoding the polypeptide shown 
as SEQ ID NO:2. 

8. The isolated nucleic acid molecule of claim 6 compris- 
ing a nucleic acid sequence encoding the polypeptide shown 
as SEQ ID NO:2, lacking its associated signal peptide. 

9. The isolated nucleic acid molecule of claim 6 compris- 
ing the nucleic acid sequence shown as SEQ ID NO:l. 

10. The isolated nucleic acid molecule of claim 6 com- 
prising the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO:l. 

11. The isolated nucleic acid molecule of claim 6 com- 
prising the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532. 

12. An isolated nucleic acid molecule mat hybridizes to: 

(a) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO:2; 

(b) a nucleic acid sequence encoding the polypeptide 
shown as SEQ ID NO: 2, lacking its associated signal 
peptide; 

(c) the nucleic acid sequence shown as SEQ ID NO:l; 

(d) the full-length coding sequence of the nucleic acid 
sequence shown as SEQ ID NO:l; or 

(e) the full-length coding sequence of cDNA deposited 
under ATCC accession number PTA-1532; wherein 
said isolated nucleic acid encodes a polypeptide which 
is a receptor for and binds to the ligand polypeptide 
shown as SEQ ID NO:4. 

13. A vector comprising the isolated nucleic acid mol- 
ecule of claim 1. 

14. The vector of claim 13, wherein said isolated nucleic 
acid is operably linked to control sequences recognized by 
a host cell transformed with the vector. 

15. A host cell comprising the vector of claim 13. 

16. The host cell of claim 15, wherein said cell is a CHO 
cell, an E. coli or a yeast cell. 



